\, 


■^i^.<if:jy- 


^ 

^HK 


i'f.j^\4 


^ 


n( 


t 


l\ 


R<Hv 


W'- 


-V■J^•  1,  s  «* 


^; 


TT 


■*.")*  t'- 


tisr- 


Vy  ^ 


1^ 


f^ 


>*.>•'■ 


.A."5|Lfol*" 


V 


A 


( .  '         y 


^~f 


>k^Trv^i, 


>*4' 


t>-^v 


*v 


^'^^L-^ 


V*A 


«^H^- 


j*cf^jr<^wc» 


*^- 


/^, 


<k 


'T* 


-yV^ 


-\^ 


■^^^ 


1 1(4^. 


^*n 


.I'j^'^ 


'^.M 


7*-flM"'jl 


') 


^^^^XJ: 


?rAj:4 


'/.- 


if***- 


tSi.'^ 


:^^j>T 


^^ 


,? 


/j 


i 


if 


ROTHAMSTED     MEMOIRS 

(FROJI    THE      PHILOSOPHICAL  TJ{AXSA('T10NS.') 


Voi.ltmf;  IV. 
1!)14_19:^ 

(<iii;iil()   Series;') 


ROTHAMSTED  EXPERIMENTAL  STATION,  HARPENDEN. 


ST.  ALBAN'S: 

IJIBBS    AND  BaMFUKTH    LTD..    SPENCER    STREET 

1934. 


ROTH AM ST ED     MEMOIRS 


CONTENTS  OF  VOLLME  IV.  (Quarto  Series). 

PnbUBliwl 

1.  "  Some  Notes  on   Soil  Protozoa."     C  H.   Martin  and  K.   R.   Lbwin. 

Pliil.   Trans.    B.   Vol.   Ci^Y..   pp.   77-94       ...  ...  ...  ...       19U 

2.  "  On  a   Foiiu   ot   Botri/tis  ('itterca,   with  colourless  Sclerotia."     W.  B. 

Beierlry.    Phil.  Trans.  B.  Vol.  OCX.,  pp.  83  lU.    ...  ...  ...       1920 

3.  •'  On   tlic  .Mathematical    I'miiidalinns  oC  Tiieoretical  Statistics."     R.  A. 

FiSHRR.     Phil.  Trans.  A,  Vol.  CCXXII..  pp.  309  3fiS.  ...  ...       1922 

4.  "  A  Quantitative   Investigation   of   llic  Bacterial  and  Protozoan  Popu- 

lation of  the  Soil,  with  an  Account  of  the  Protozoan  Fauna."  D.  Ward 
Cutler.  L.  M.  <'rump  anrl  FT.  Sando.v.  Phil.  Trans.  B.  Vol.  C(-XI., 
pp.  317  .S.".0.  ...  ...  ...  ...  ...  ...  ...       1922 

~).    "  The  InHiience  of  Rainfall  on  the  Yield   of   Wheat   at    Rothamsted." 

R.  A.  Fisher.    Phil.  Trans.  R,  Vol.  CCXITI..  pp  89142.        ...  ...       1924 

0.  "  The  Interaclion  of  Clay  willi  Watei-  ;uid  Organic  Li()uids  as  Measured 
by  Specific  Volume  Chan}>es  aufl  its  Relation  to  the  Phenomena  of 
Crumb  Foi'mation  in  Soils."  E.  W.  RussBix.  Phil.  Trans.  A,  Vol. 
Ci^XXXTTl..  pp.  .3(i1  .3S9.  ...  ...  ...  ...  ...  ...       1934 


Tiie  foUowiufi-  paper  is  not  included  in  the  Volume,  as  it  also  appeared 
in  Jour.  Aoric.  Sci..  Vol.  VI..  pp.  278.301  (1914).  See  Rothamsted 
Memoirs.    Vol.    VI..   Oclavo   Scries:  — 

"  Tile  Soil  Solulion  and  the  :Mineral  Constituents  of  the  Soil."  A.  D. 
Ham,.  W.  K.  liRRXCHi.RV  and  L.  :\1.  CxDERWOon.  Phil.  Trans.  B.  Vol. 
CCIV..  pp.  ITItl'OO.  ...  ...  ...  ...  ...  ...        1914 


[    77    ] 


III.  Some  Notes  on  Soil  Protozoa. 

By  C.  H.  Martin,  M.A.,  and  K.  E.  Lewin,  B.A.,  Rothamsted  Experimental 

Station. 

Communicated  by  Horace  T.  Brown,  F.R.S. 

(Received  December  5,  1913, — Read  February  5,  1914.) 

[Plates  5  and  6.] 

Contents. 

PAGE 

I.  Introduction 77 

II.  Methods  of  Preparation  and  Culture 80 

III.  Cucumber  bed — Vahlkampjia  soli,  n.  sp.,  Aniceha  cucumis,  n.  sp 82 

IV.  Seedling  bed — Amwha  gohanniensis,  n.  sp 89 

V.  Conclusions 91 

VI.  Literature  List 92 

VII.  Description  of  Plates 93 

Introduction. 

It  is  rather  a  curious  fact  that  it  has  been  left  to  agricultural  chemists  to  bring 
into  prominence  the  important  part  that  free-living  Protozoa  may  play  in  the  soil. 
At  the  present  time  it  seems  to  us  that  the  prevalent  idea  as  regards  the  distribution 
of  free-living  protozoa  is  that  they  can  exist  in  the  trophic  state  only  in  definite 
accumulations  of  fresh  water  {e.g.,  pools,  rivers,  lakes)  or  in  the  sea.  It  seems  also  to 
be  generally  held  that  after  the  drying  up  of  small  pools  the  cysts  of  protozoa  are 
transported  by  currents  of  air,  and  that  it  is  to  these  wind-borne  cysts  that  the 
prevalence  of  protozoa  in  cultures  of  ordinary  soils  may  be  mostly  attributed.  It  is 
very  difficult  to  trace  the  historical  origin  of  this  view,  but  it  is  evident  that  to  the 
early  workers  on  protozoa  the  idea  of  this  limitation  of  their  active  life  to  water  was 
unthought  of 

It  is  unnecessary  to  refer  here  to  the  works  of  very  early  writers,  by  whom  the 
spread  of  disease  was  sometimes  attributed  to  invisible  forms  of  life  present  in  the  air. 
It  would  seem,  however,  from  the  following  quotation  that  Ehrenberg  held  a 
far  wider  view  as  regards  the  distribution  of  protozoa  than  that  which  is  fashionable 
at  present.  In  his  great  work  '  Die  Infusionsthierchen  als  Vollkommene  Organismen,' 
p.  496,  he  says  : — - 

"  Jeder  der  bekannten  lebenden  Korper  besitzt  eine  Orgauisations-Feuchtigkeit. 
So  lange  er  diese,  den  ihn  bestiirmenden  physikalischen  Naturkriiften  entgegen- 
kampfend,  in  seinen  Hauptorganen  erhiilt,  so  lange  ist  er  lebend  ;  sobald  sie  durch 

VOL.   CCV. B.    315.  Piiblislied  separately,  May  30,  1914. 
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HItze  Frost  oder  eigene  Schwiiche  verloren  geht,  oder  durch  und  durch  erstarrt, 
erfolgt  der  Tod,  der  auch  auf  manche  audere  Weise  eintretea  kann.  Diese  Organisa- 
tions-Feuchtiskeit  nehmen  Kiiferlarven  im  diirrsten  Holze,  Mottenlarven  im  diirrsten 
Pelze,  Infusorien  und  Mooswurzeln,  Sameii,  dergl,  im  diirrsten  Sande  aus  dem 
Dunste  der  Atmosphare  in  sich  auf,  bleiben  fleischig  und  feucht  und  niissen  sogar 
ihre  Umgebung.     Lebende  Dammerde  bleibt  feucht." 

It  is  important  to  note  that  at  the  time  this  work  was  written  the  cysts  of  Protozoa 
were  still  unknown,  and  that  therefore,  for  Ehrenbebg,  life  meant  trophic  life,  and 
not  latent  life  in  the  cyst. 

By  later  workers  this  wide  view  of  the  distribution  of  protozoa  seems  to  have 
become  gradually  more  limited.  For  example,  Dujardin  in  his  account  ol  the 
protozoa  in  the  '  Histoire  Naturelle  des  Zoophytes,'  1841,  p.  169,  states  :— 

"  Certains  Infusoires  vivent,  non  pas  simplement  dans  les  eaux,  mais  dans  les  sites 
habituellement  humectes,  comme  les  touffes  de  mousses,  et  surtout  les  couches  minces 
d'oscillaires,  sur  la  terre  ou  sur  les  murs  humides ;  pour  les  trouver,  il  suffit  d'agiter 
et  de  presser  dans  un  vase  d'eau  successivement  jjlusieurs  touffes  de  mousse  prise  au 
pied  des  arbres,  dans  les  lieux  frais,  ou  au  bord  des  ruisseaux  ;  ou  bien  de  placer  dans 
una  soucoupe,  avec  un  peu  d'eau,  la  pellicule  enlevee  a  la  sui'face  du  sol  convert 
d'oscillaires." 

From  this  quotation  it  is  evident  that  Dujardin  regarded  the  existence  of  the 
Protozoa  in  their  trophic  state  in  some  saturated  soils  as  certain,  but  even  in  the  case 
of  saturated  soils  later  writers  seem  to  have  believed  that  the  forms  met  under  these 
conditions  were  mostly  present  as  cysts.  It  is  to  Stein  that  we  owe  the  discovery 
that  nearly  all  free-living  protozoa  can  encyst,  and  it  is  probably  to  him  also  that  we 
owe  the  present  limitation  of  our  view  as  to  the  distribution  of  protozoa.  On  p.  20 
of  his  work  '  Der  Organismus  der  Flagellaten '  (1878)  he  states  : — 

"  Die  encystirten  Infusionsthiere  erwachen,  wenn  sie  friiher  oder  spiiter  unter 
Wasser  gesetzt  werden,  zu  neuem  Leben  und  durchbrechen  meist  schon  nach  wenigen 
Stunden  ihre  sich  erweichende  und  aufquellende  Cyste.  Sie  konnen  aber  auch  in 
trockenem  Zustande,  gleich  PHanzeusamen  und  Pflanzensporen,  durch  die  Winde  auf 
weite  Entfernungen  bin  fortgcfuhrt  und  auf  den  verschiedensten  Gegenstiinden,  z.  B. 
auf  dem  ausgebreiteten  lieu  der  Wiesen,  auf  den  Moosilberziigen  alter  Gemiiuer, 
zwi.schen  der  rissigen  Rinde  alter  Bilume  oder  im  Sande  der  Ditcher  abgesetzt  werden. 
Uebergiesst  man  dann  solche  Gegenstiinde  mit  Wasser,  so  liefern  die  daran  haftenden 
Cysten  ebenfalls  schon  nach  wenigen  Stunden  die  freie  bewegliche  Infusorienform." 

This  view  seems  to  have  ap])ealed  to  KtrrscHLl  since  in  his  work  on  Protozoa 
(I  Abtheiluug,  Sarcodina  und  Sporozoa,  1880-82)  he  states  on  p.  1G2  : — 

"  So  treffen  wir  Amciben  in  feuchtem  Sand  oder  Moos  von  Biiumen,  und  zwar  sowohl 
am  Kusse  soldier  als  in  ziemlicher  Hohe  iiber  dem  Erdboden  an     ...     . 
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"  Das.s  es  sich  in  diesen  Fallen  meist  urn  Formen  handelt,  die  dnrcli  Winde  im 
encystirten  oder  zum  Theil  vielleicht  auch  nicht  encystirten  Zustand  gewissermaassen 
verschlagen  wnrden,  diirfte  keineni  Zweifel  unterliegen. 

"  In  dieselbe  Kategorie  diirfen  wir  vielleicht  auch  die  von  Cienkowsky  auf 
Pferdemist  beobachtete  Diplophrys  stercoreum  und  das  uuter  ahnlichen  Verhiiltnissen 
getroffeue  Plutoum  stercoreum,  sowie  den  jedenfalls  zur  gleichen  Gattung  gehorigen, 
von  Gabriel  in  feuchter,  mit  thierischen  Excrementen  diirchsetzter  Erde  gefundenen 
soo-en.  Trocdodvtes  rechnen,  deren  niichste  Verwandte  ja  das  siisse  Wasser 
bewohnen." 

Finally  it  is  to  this  view  of  the  distribution  of  protozoa  that  we  probably  owe  a 
recent  interesting  paper  by  E.  M.  Puschkarew.*  Puschkarew  left  sterilised  plates 
exposed  to  the  air  and  examined  the  cultures  that  he  so  obtained.  Of  the  13  forms 
he  obtained  in  this  way  all  can  be  found  in  soil,  but  it  seems  fairly  clear  that  he 
believed  that  nearly  all  these  forms  in  his  cultures  owed  their  origin  to  cysts  from  dried 
pools  on  the  banks  of  the  river  which  had  been  transported  by  the  wind,  cp.  p.  331. 

"  Herbst  und  Winter  1910/11  wareu  nun  zum  Teil  sehr  regnerisch  und  also 
ungunstig  fiir  meine  Arbeit.  Nun  kam  aber  das  Friihjahr  1911  mit  relativ  wenig 
Regen  und  endlich  der  Sommer  mit  seiner  grossen  Hitze  ;  viele  Sihupfe  waren  ganz 
ausgetroeknet ;  das  Wasser  in  Fliissen  und  Seen  stand  kolossal  niedrig  ;  der  meistens 
mit  Alaren  bewachsene  Fluss-  und  Seeboden  war  zum  Teil  von  Wasser  befreit ;  die 
ausgetrockneten  Algen  waren  mit  Protozoencysteu  bedeckt  und  die  Luftstromungen 
konnten  letztere,  zusammen  mit  feinen  Schlammteilchen,  vom  Boden  in  die  Hohe 
reissen  und  zu  entferuten  Orten  transportiereu." 

Of  recent  years  a  large  amount  of  our  knowledge  of  that  difficult  group  the  Amoebae 
has  been  obtained  from  animals  cultivated  from  soils  (Nagler  and  Glaser),  but  it  is 
interesting  to  note  that  little  if  any  attention  has  been  paid  to  the  biology  of  the 
forms  thus  obtained,  in  their  natural  state  ;  and  we  believe  that  to  most  biologists  the 
idea  that  ordinary  soils  contain  a  lai'ge  number  of  protozoa  in  an  active  condition  is  a 
new  one.  The  view  that  the  presence  of  such  protozoa  in  excessive  numbers  may  be 
the  cause  of  soil  sickness  was,  we  believe,  first  put  forward  by  Pi.ussell  and 
Hutchinson  in  a  paper  "  On  the  Effect  of  Partial  Sterilisation  of  Soil  on  the 
Production  of  Plant-Food." t 

In  working  with  these  soil-protozoa,  so  long  as  one  was  confined  to  culture  methods 
it  was  obviously  impossible  to  decide  how  far  any  given  culture  of  protozoa  gave  a 
true  pictui-e  of  the  life  in  a  soil,  and  how  far  the  organisms  in  the  culture  were  simply 
derived  from  cysts.  There  were,  of  course,  certain  indications,  as  one  of  us  has 
pointed  out  in  a  previous  paper,  that  certain  faunas  were  connected  with  certain  soils, 

*  "  Ueber  die  Verbreituiig  der  Siisswasserprotozoeu  durch  die  Luft,"  '  Archiv  fiir  Protistenkunde,' 
Band  28. 

t  'Journal  of  Agricultural  Science,'  vol.  3,  Part  II  (1909). 
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but  even  then  one  was  left  absolutely  in  the  dark  as  to  what  degree  of  saturation  of 
water  was  necessary  in  these  soils  before  the  animals  found  in  the  cultures  could  lead 
an  active  life  in  the  soil. 

By  a  method  which  one  of  us  has  described  in  a  recent  letter  to  '  Nature,'  No.  22G6, 
vol.  91,  1913,  and  which  is  given  in  detail  below,  it  is  possible  to  obtain  results  which, 
though  minimal,  are  definite  as  to  the  state  of  the  fauna  in  any  given  soil  at  any 
moment.  Up  to  the  present  the  only  soils  we  have  examined  thoroughly  by  this 
method  are  (1)  a  cucumber  bed,  and  (2)  a  seedling  bed  obtained  from  the  layer  of  soil 
underneath  the  turf  in  a  neighbouring  field  and  free  from  any  admixture  of  manure. 

In  this  paper  we  propose  giving  some  account  of  the  organisms  found  in  an  active 
state  in  these  two  soils  by  this  method,  together  with  some  further  details  on  the 
life-cycles  of  these  forms  in  cultures.  In  a  future  paper  we  hope  to  return  to  this 
question  of  the  existence  of  active  protozoa  in  the  soil  in  greater  detail. 

We  should  like  to  take  this  opportunity  of  thanking  Dr.  Ettssell  for  the  great 
interest  wliich  he  has  taken  in  this  work.  We  wish  also  to  thank  Prof  MiNCHiN 
for  his  great  kindness  in  allowing  us  to  write  up  our  results  in  his  department  at  the 
Lister  Institute,  and  Miss  Rhodes  for  the  trouble  she  has  taken  over  the  drawings. 

Methods  of  Preparation  and  Culture. 

The  only  method  by  which,  so  far  as  we  are  aware,  the  presence  of  active  protozoa 
in  the  soils  can  be  satisfactorily  demonstrated  is  one  which  one  of  us  has  recently 
described  in  a  letter  to  '  Nature.'  For  this  purpose  a  small  quantity  of  the  soil  to  be 
examined  is  added  to  about  an  equal  quantity  of  a  saturated  aqueous  solution  of 
picric  acid  ;  the  mixture  is  then  stirred  very  thoroughly,  so  that  the  protozoa  which 
are  situated  on  the  liquid  films  between  the  soil  particles  are  freed.  The  mixture  is 
then  allowed  to  stand  for  some  time  (12-24  hours),  and  it  will  be  found  that  the 
scum  which  rises  to  the  surface  contains  a  proportion  of  the  bacteria  and  protozoa 
of  the  soil.  Cover-slip  preparations  can  then  be  made  by  floating  cover-slips 
upon  the  surface  of  the  mixture.  For  the  purpose  of  this  work  the  cover-slips  which 
one  of  us  described  in  a  paper  on  "  A  Note  on  the  Protozoa  from  Sick  Soils,  with 
some  Account  of  the  Life-Cycle  of  a  Flagellate  Monad,"*  are  very  convenient.  The 
cover-slip  smears  thus  obtained  can  be  put  in  corrosivef  or  simply  washed  out  in 
70-per-cent.  alcohol  and  then  stained  and  mounted  in  the  ordinary  way.  If  the 
organisms  in  the  soil  are  scarce  it  will  be  found  an  advantage  for  purposes  of 
searching  to  over-stain  in  eosin. 

There  can  be  little  doubt  that  this  method  is  not  an  ideal  one,  and  that  it  only 
gives  minimal  results,  still  in  suitable  soils  quite  good  results  can  be  obtained, 
for  example,  in  the  cucumber  soil,  the  fauna  of  which  is  described  below,  up 
to  150  Amoebae  have  been  obtained  on  a  single  film.     It  is  to  be  hoped,  however, 

*  'Roy.  Soc.  Proc.,'  B,  vol.  85. 

t  Vide  Mann's  'Physiological  Histology,'  p.  71. 
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that  in  the  near  future  some  better  method  will  be  I'uund  for  dealing  witli  this 
problem,  more  particularly  in  poor  clay  soils,  though  at  present  we  must  confess  it  is 
very  hard  to  discover  one. 

It  would  we  feel  be  premature  at  present  to  attempt  a  formal  list  of  the  culture 
media  on  which  soil  protozoa  flourish.  In  all  cases  of  cultures  of  soil  protozoa,  so  far 
as  we  are  aware,  up  to  the  present,  as  Vahlkampf  clearly  insisted  in  his  paper  on 
the  biology,  etc.,  of  Amoeba  Umax*  the  protozoa  feed  upon  the  bacteria  of  the 
culture,  and  hence  almost  any  culture  media  on  which  soil  bacteria  flourish  will 
probably  support  a  large  number  of  protozoa. 

Therefore  in  those  cases  in  wln'ch  the  expression  "  pure  animal  culture"  is  used  we 
only  wish  to  indicate  that  the  culture  contained  only  one  form  of  pi'otozoon,  though 
of  course  it  contained  large  numbers  of  bacteria.  It  may  of  course  be  possible  in  the 
future  to  obtain  cultures  of  some  saprozoic  protozoa  free  from  bacteria,  and  in  certain 
cases  we  have  found  indications  that  certain  Amoebse  show  a  distinct  preference  for 
certain  culture  media,  thougii  here,  again,  this  efiect  may  be  a  secondary  one  due  to 
the  encouragement  of  a  certain  type  of  bacteria. 

Up  to  the  present  we  have  mainly  used  solid  media  for  our  cultures,  as  we  find 
that  they  are  far  more  convenient  for  isolating  any  given  form.  We  used  two  types 
of  culture  media,  one  an  ordinary  agar  made  up  of  1000  c.c.  meat  extract  and 
1 5  grm.  of  agar  ;  but  we  have  found  a  culture  medium  of  Friedberger  and  Reiter 
described  in  KoLLE  and  Wassermann's  '  Handbuch  der  pathogenen  Mikro- 
organismen,'  vol.  1,  gives  very  good  results  for  most  soil  protozoa;  it  consists  of 
a  horse-dung  agar  made  up  of  three  lumps  of  horse  dung  and  500  c.c.  of  water,  this 
mixture  Is  boiled  for  one  and  a  half  hours,  then  filtered  through  cloth,  and  finally 
about  y  grm.  of  agar  is  added.  In  many  cases  where  it  is  used  to  get  a  very  strong 
growth  of  protozoa  it  is  advisable  to  add  a  small  amount  of  water  or  dilute  albumen 
to  the  culture  plates  to  about  a  depth  of  2  mm.  This  addition  of  water  seems 
to  obviate  the  vacuolated  appearance  which  some  workers  have  noted  as  characteristic 
of  culture  Amoebae  on  plates. 

The  stock  cultures  are  made  up  by  adding  a  little  soil  directly  to  the  plates.  If 
these  stock  cultures  are  examined  from  time  to  time  it  will  be  found  that  in  any 
given  culture  there  is  a  more  or  less  definite  succession  of  animal  forms.  By 
selecting  the  time  and  method  of  culture  it  will  probably  be  found  possible  to 
get  pure  animal  cultm^es  of  any  of  these  forms. 

In  this  connection  it  may  be  interesting  to  note  the  results  obtained  from  two 
cultures  which  would  seem  to  indicate  that  the  protozoa  in  the  soil  are  to  a  great 
extent  feeders  on  bacteria,  and  that  the  number  of  bacteria  in  the  mud  from  ordinary 
fresh  water  pools  does  not  seem  to  oft'er  equal  scope  for  true  feeders  on  bacteria. 
Two  cultures  were  made  from  the  cucumber  soil  on  manure  agar  to  which,  in  place 
of  water,  Molisch's  solution  was  added.      In  these  cases  the  algse  got  the  upper  hand 
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of  the  bacteria  on  the  culture,  and  possibly  as  a  result  of  this  the  culture  from 
a  protozoon  point  of  view  was  a  very  poor  one.  The  counter-experiment  was  made 
with  some  of  the  cultures  of  Amoeba  proteus,  which  were  placed  with  some  mud  on 
manure-agar  plates,  and  here  the  bacteria  did  not  develop  to  anything  like  the 
extent  to  which  they  would  have  done  from  an  ordinary  soil  culture,  and  none  of  the 
ordinary  soil  protozoa  were  found. 

Cucumber  Bed. 

In  July,  1913,  through  the  kindness  of  a  grower  in  the  neighbourhood  of 
Rothamsted  we  had  an  opportunity  of  examining  a  sick  cucumber  border.  In  order 
to  give  a  fair  picture  of  the  very  special  conditions  prevailing  in  these  cucumber  beds 
we  have  thought  it  well  to  give  the  history  of  this  particular  border  bed  from 
its  origin.  It  was  made  up  in  February,  1912,  of  one  part  by  volume  of  light 
pasture  soil,  one  part  of  heavy  pasture  soil,  and  two  parts  by  volume  of  horse  manure. 
During  the  season  (April  to  June  15)  the  beds  were  either  mulched  and  given  in 
alternate  weeks  a  small  dose  of  chemical  food  or  were  given  a  dressing  of  soil,  cow- 
dung,  and  soot.  After  June  15  the  same  treatment  was  applied  with  longer 
intervals.  Towards  the  end  of  the  year  two-thirds  of  the  bed  was  removed  and 
replaced  by  horse  manure.  On  December  27,  1912,  the  bed  was  steam  sterilised 
(20  minutes  on  a  steam  grid)  and  then  treated  as  in  the  previous  year.  The  water 
content  on  the  date  on  which  we  first  examined  the  bed  (August  1,  1913)  was  in 
tlie  top  layer  62  per  cent,  by  weight,  and  in  the  bottom  layer  55  per  cent.  On 
October  7,  1913,  the  water  content  was  57  per  cent,  in  a  sample  from  the  bottom. 
Probably  the  water  content  does  not  vary  much  in  these  beds.  At  first  sight  this 
water  content  may  seem  high  to  those  who  have  had  no  experience  of  estimations  of 
soil  moisture,  but  the  soil  at  the  time  of  examination  was  only  damp  to  the  touch 
and  there  was  no  sign  of  excessive  water. 

Preparations  were  made  of  the  fresh  soil  by  the  picric  acid  method  described  above, 
and  the  lollowiiig  organisms  were  discovered  in  the  trophic  condition  on  the  films  so 
prepared  : — 

1.  I<Ai<ilyj>ha,  ?sp.  (Plate  5,  fig.  1).  5.  Flagellate,  ?gen.  (Plate  5,  fig.  3). 

2.  Trinema,  ?  sp.  G.  Chilodon  sp.  (Plate  5,  fig.  2). 

3.  VaJilkampfia  soli.,  n.  sji.  (Plate  5,  fig.  10).  7.  Ciliate,  ?  gen. 

4.  Amwba  cucumis,  n.  sp.  (Plate  (5,  fig.  20).  8.  Ciliate,  ?  gen. 

Of  these  the  thecanioebse  were  probably  present  in  this  soil  in  considerable  numbers, 
more  than  is  indicated  by  the  number  of  forms  obtained  on  the  films. 

This  is  probably  due  to  the  fact  that  under  the  most  ftivourable  circumstances  it  is 
rather  difficult  to  get  good  preparations  of  these  thecamoebse  from  a  numerical  point 
of  view  by  the  cover-slip  method. 

The  Amoebaj  were  present  on  the   films  in  large   numbers,    e.g.,  up  to   150  good 
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specimens  in  an  active  trophic  condition,  with  large  numl)ers  of  ingested  bactei'ia, 
were  counted  on  a  single  film.  These  were  probably  the  dominant  protozoa  in  this 
soil  during  the  time  it  was  under  examination. 

The  flagellates  were  very  rare  on  these  films,  and  this  was  rather  curiou.s,  as  in 
cultures  from  this  soil  they  were  veiy  numerous  (Prowazekia  and  Copromonas, 
etc.).  It  would  seem  tliat  this  result  can  be  explained  by  one  of  the  two  following 
hypotheses  : — 

(ffl)  The  films  gave  a  true  picture  of  the  life  in  this  soil,  and  flagellate  forms  were 
not  present  in  large  numbers  in  the  trophic  condition,  possibly  because  the  water- 
content  of  the  soil  was  not  large  enough  to  allow  them  sufficient  liberty  for  long  free 
paths  ;  or 

[h)  The  flagellates  were  present  in  an  active  condition,  but  the  films  did  not  give  a 
true  picture,  because  the  long  flagella  might  have  become  entangled  in  the  soil 
particles,  and  thus  have  jirevented  the  flagellates  being  carried  to  the  surface-film. 

At  present  we  are  inclined  to  accept  the  former  of  these  views,  because  in 
preparations  made  in  the  above  manner  from  other  soils  we  have  found  fair  numbers 
of  flagellates  in  the  trophic  phase. 

The  ciliates  in  these  preparations  ai'e  also  very  rare,  and  here  it  seems  probable 
that  their  scarcity  on  the  films  rejjresents  fairly  the  state  of  affairs  in  this  soil.  It 
is  possible  that  in  absolutely  saturated  soils  ciliates  may  play  an  active  part  as  a 
bacterial  check,  but  it  is  difficult  to  believe  that  they  can  exercise  an  important 
rdle  in  a  sick  soil-bed  like  the  one  under  examination.  On  three  occasions  ciliates 
were  found  in  fresh  soil-smears  :  one  appeared  to  be  a  Cliilodon  (Plate  5,  fig.  2),  and 
the  two  other  forms  have  not  been  identified  with  any  certainty. 

Of  these  forms  we  only  propose  to  deal  at  length  with  the  two  Amoebas,  since  not 
only  were  they  probably  the  most  important  checks  on  the  bacteria  in  this  soil  at  the 
time  of  examination,  but  they  also  gave  the  most  interesting  results  on  the  cultures. 

Vahlkampfia  soli  (n.  sp.). 

To  judge  from  the  evidence  of  the  fresh  films,  this  must  liave  been  the  dominant 
form  in  the  cucumber  bed  during  the  mouth  of  August.  Although  this  animal  agrees 
in  many  important  points  with  the  Amoeha  Umax  of  Vahlkampf's  classical  paper,  yet 
there  are  certain  small  points  of  difference  which  seem  to  justify  the  formation  of  a 
new  species  for  the  reception  of  these  forms. 

In  life  Vahlkampfia  soli  is  a  very  active  form,  showing  the  characteristic 
movements  of  an  amoeba  of  the  Umax  group.  In  progression  a  single,  large 
pseudopodium,  composed  almost  entirely  of  ectoplasm,  is  usually  thrown  out,  and  the 
granular  entoplasm  seems  to  stream  into  it.  Very  often  the  posterior  end  shows  a 
characteristic  tufted  appearance,  and  it  is  interesting  to  note  that  this  appearance  is 
often  found  in  animals  on  fresh  smears  (Plate  5,  fig.  10),  showing  that  these  forms 
were  actively  moving  through  the  soil  at  the  time  they  were  fixed  by  the  addition  of 
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picric  acid.  A  contractile  vacuole  is  present,  and  is  usually  discharged  at  the 
posterior  end.  The  period  of  active  life  on  these  culture-plates  at  18°  C.  seems  to  be 
about  3-4  days.  In  life  certain  highly  refractile  granules  are  fairly  frequent  in  the 
endoplasm,  especially  of  young  forms.  In  fresh  smears  prepared  from  this  cucumber 
soil  by  the  picric  acid  method,  the  trophic  stages  of  this  animal  were  very 
abundant  during  the  month  of  August.  The  forms  thus  found  were  usually  very 
active,  and  contained  large  numbers  of  ingested  bacteria  (Plate  5,  fig.  lO).  On  some 
smears  indications  of  division  were  found  (Plate  5,  fig.  4),  thus  clearly  showing  that 
Vahlkamjjfia  soli  flourished  in  tliis  soil  at  this  date.  Vahlkamj)fia  soli  was  found  to 
do  fairly  well  on  manure-agar  cultures  with  water,  and  we  also  tried  plates  made  up 
of  a  straw  decoction,  which  had  been  recommended  by  Vahlkampf.  In  stanied 
forms  from  young  cultures  the  nucleus  is  a  spherical  body,  with  a  large,  darkly- 
staining  karyosome  of  rather  elliptical  form.  The  membrane  is  a  fairly  definite 
structure,  and  ranged  upon  it  there  appear  to  be  a  number  of  masses  staining  with 
eosin.  These  forms  from  young  cultures  corresponded  in  every  way  with  the  forms 
found  in  the  fresh  films,  and  it  would  seem  that  tliis  appearance  of  the  nucleus  is 
correlated  with  the  intense  reproductive  activity  characteristic  of  this  stage.  In 
older  cultiu-es  the  karyosome  is  rounder,  smaller,  and  denser  ;  some  of  the  stages  of 
division  appear  to  resemble  very  closely  stages  figured  by  Glaser  in  the  division  of 
Anmha  tachypodia,  and  probably  some  of  the  stages  figured  by  Aragao  for  Amoiha 
dvplomitotica. 

It  is  highly  probable,  iiowever,  that  there  are  a  very  large  number  of  amoebse  for 
which  this  statement  will  l)t*  found  true. 

As  regards  the  behaviour  of  the  cytoplasm  during  division,  it  is  perhaps  worth 
noting  that  the  rounding  up  which  Glaser  has  stated  to  be  always  characteristic  of 
dividing  amoebaB  is  not  seen  in  this  form,  and  that  the  stage  of  nuclear  division  does 
not  seem  to  bear  an  absolutely  fixed  relation  to  the  progress  of  the  cytoplasmic 
division. 

The  nuclear  division  of  this  amoeba  belongs  to  that  very  difficult  type,  for  which 
NagLER  has    invented    the   term   of   promitosis.      In    the    early   stages    of   division 
the   karyosome  swells  and  becomes  very  elliptical,  and  the  mass  of  the   chromatin 
then   gets    concentrated   at   either    pole    of  the    elongated    karyosome.      There  can 
be   no   doubt  that  the  division    of  the  arreat  mass   of   chromatin    contained   in   the 
resting  nucleus  is  effected  by  this  simple  means.     Tlie  question  of  chromosomes  in 
this  division  is  an  extraordinarily  difficult  one,  and  one  camiot  help  feeling  that  in 
divisions  of  this  kind  the  result  arrived  at  depends  nuich  upon  the  technique  adopted, 
and    in    many    cases   it   would    appear    that    too    much    reliance    is    placed   on   iron 
hsematoxylin   preparations  alone.      In  this  case  we  have  tried  the  combined   results 
obtained  from  iron  hajmatoxylin  and  ha^malum  preparations,  though  we  decided,  as  a 
matter  of  caution  only,   to   Hgure  the   hsemalum  preparations.     At  a  slightly  later 
stage  it  would  appear  that  the  median  portion  of  the   karyosome  becomes  broken  on 


MESSRS.  C.  H.  MARTIN   AND   K.  R.  LEWIN:    SOME  NOTES   ON   SOIL   PROTOZOA.     85 

one  side  into  a  mass  of  irregular  granules  (Plate  5,  fig.  11).  This  stage  seems  to 
correspond  in  some  respects  with  that  figured  by  Glaser  for  Amoeba  tachyjjodia,  but 
in  this  case  he  is  inclined  to  refer  the  origin  of  these  granules  to  extra-karyosomic 
chromatin.  The  dumb-bell-shaped  karyosome  is  now  broken  through  (Plate  5, 
figs.  12  and  13),  and  the  two  polar  masses  gradually  pass  to  the  opposite  extremities  of 
the  dividing  animal  (Plate  5,  fig.  14).  The  connecting  bar,  which  seems  to  consist  of 
a  mixture  of  the  chromatin  granules  mentioned  above  and  an  achromatic  mass, 
becomes  gradually  thinner  at  its  middle,  and,  as  by  this  process  chromatin  granules 
are  brought  much  closer  together,  it  becomes  at  the  same  time  much  more  con- 
spicuous (Plate  5,  fig.  15).  Diu'ing  this  stage,  for  some  unknown  reason,  the 
chromatin  becomes  massed  in  three  volumes,  viz.  the  two  polar  masses,  and  the 
median  portion  of  the  connecting  bar,  so  that  the  characteristic  appearance  shown  in 
Plate  5,  fig.  15,  is  originated  by  two  polar  masses,  an  intervening  clear  space,  and  a 
darkly-stained  central  bar.  The  daughter  nuclei  now  become  separated  through  the 
snapping  of  this  connecting  bar,  and  in  the  reconstruction  of  the  nuclei  it  would 
appear  that  the  wedge-shaped  mass  derived  from  the  connecting  bar  loses  its 
chromatin  in  the  form  of  irregular  granules,  which  fuse  with  the  karyosome  (Plate  5, 
fig.  16).  Jt  will  be  obvious  from  this  account  that,  though  we  difler  with  Glaser 
over  some  slight  points  of  detail,  yet  in  this  form  also  we  can  find  no  trace  of  the 
existence  of  a  centriole. 

In  life  the  cysts  of  these  amoebfe  are  very  characteristic  objects  on  the  culture- 
plates.  They  have  an  outer  gelatinous  coat,  to  which  foreign  objects  become 
attached,  and  an  inner  resistant  wall  of  a  yellowish  colour.  The  cysts  are 
frequently  grouped  together  in  a  culture,  and  this  feature  appears  to  be  somewhat 
important,  as  upon  it  some  of  the  earlier  workers,  e.g.,  Zopf,  place  similar  forms  in 
the  group  of  Mycetozoa.  In  his  accovnit  of  Copromyxa  'prolca  in  '  Die  Pilzthiere 
oder  Schleimpilze,'  Zopf  states  on  p.  133  : — 

"  Da  auf  trocknen  Mistculturen  die  Amceben  gewohnlich  nicht  alle  neben,  sondern 
zum  Theil  fiber  einander  kriechen,  so  entstehen  die  eingangs  erwiihnten  Sporen- 
haufchen  (Sori)  (fig.  31,1,  II)." 

The  mature  cysts  are  extraordinarily  difiicult  to  stain.  It  seems,  however,  from 
early  stages  of  encystation,  such  as  is  shown  in  Plate  6,  fig.  19,  that  there  is  a 
single  central  nucleus,  and  we  have  no  reason  to  suppose  that  the  cyst  is  anything 
but  a  protective  cyst.  Under  ordinary  cultural  conditions  tliis  form  emerges  from 
the  cysts  as  a  typical  /iHW(X'-amoeba  ;  in  the  process  of  excystation  the  cyst  swells 
considerably  and  the  amoeba  can  be  seen  moving  round  actively  in  the  cyst  for  some 
time  before  it  emerges  through  a  small  opening  in  the  cyst  wall. 

The  Flagellate  Form. — In  1910,  Wasielewski  and  Hirsghfeld  pointed  out  the 
appearance  of  certain  conditions  of  a  flagellate  stage  of  limax-hmceh^,  and  more 
recently  Whitmore  and  Wherry  have  described  such  stages.     In  the  present  case 
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it  was  found  that,  if  a  plate  of  maniire-agar  were  heavily  infected  with  the  cysts  and 
covered  to  a  depth  of  about  2  mm.  with  tap-water  containing  0-25  per  cent.  NaCl 
and  0-05  per  cent.  MgSO,,  after  about  16-20  hours'  incubation  at  23°  C,  flagellate 
forms  of  this  amoeba  appear  sporadically.  On  three  occasions  an  example  of  the 
flagellate  was  continuously  watched  under  a  2-5  mm.  water-immersion  objective,  and 
was  seen  to  become  amoeboid,  lose  its  flagella  by  absorption,  and  turn  into  a  typical 
specimen  of  the  amoeba.  Permanent  preparations  of  this  transition  have  also  been 
obtained  (Plate  6,  fig.  18).  The  flagellate  stage  (Plate  5,  fig.  17)  is  generally  ovoid, 
with  an  oblique  anterior  end,  but  some  specimens  are  markedly  piriform,  the  pointed 
end  being  anterior.  The  two  flagella  arise  anteriorly  and  are  each  about  equal  in 
length  to  the  body.  The  nucleus  is  situated  near  the  insertion  of  the  flagella,  and 
in  stained  preparations  a  fibril  connecting  the  nuclear  membrane  with  the  common 
root  of  the  flagella  can  be  detected. 

A  contractile  vacuole  is  present,  occurring  laterally  in  the  posterior  third  of  the 
body.  It  arises  as  a  group  of  small  vacuoles,  which  coalesce  to  one  large  one  that 
bulges  out  the  body-wall  and  eventually  bursts  like  a  bubble.  Its  period  is 
two  to  three  minutes. 

The  cytoplasm  is  fairly  homogeneous,  and  at  the  oblique  anterior  end  there  seems 
to  be  a  slight  diff"erentiation  recalling  ectoplasm.  The  flagellate  is  flexible,  but  shows 
no  tendency  to  put  out  pseudopodia. 

Movements. — These  are  quite  rapid,  translocation  being  efiected  by  a  steady 
shouldering  movement,  during  which  the  animal  rotates  on  its  long  axis.  The 
flagella  are  hardly  to  be  made  out  unless  the  flagellate  be  momentarily  at  rest.  They 
appear  to  whip  right  back  to  the  sides  of  the  body. 

Feeding. — Bacteria  are  ingested  at  the  oblique  anterior  surface,  where  a  sort  of 
ectoplasmic  differentiation  is  seen.  They  pass  by  the  nucleus  and  are  finally  digested 
in  the  median  and  posterior  parts  of  the  body. 

Transition  to  Amoeba. — The  flagellate  remains  in  one  place,  and  the  outlines  of  its 
body  appear  slightly  wavy.  Then  it  becomes  rougldy  spherical  and  protrudes 
pseudopodia  in  all  directions.  Almost  at  once  the  spherical  form  is  lost  and  a 
typical  amoeboid  appearance  is  attained.  The  flagella  lose  their  anterior  position 
and  become  lateral,  or  often  terminal,  but  for  about  seven  minutes  emerge  always  in 
the  neighbourhood  of  the  nucleus,  to  which  they  are  attached.  After  this  time  the 
connection  presumably  breaks,  and  the  position  of  the  nucleus  is  no  index  to  that  of 
the  flagella,  whioli  continually  tend  towards  the  hind  end  at  any  moment.  Shortly 
after  attaining  independence,  the  flagella  are  active  only  in  a  minor  degree,  and  soon 
become  motionless.  About  15  minutes  (hanging  drop  preparation  ;  less  under 
cover  slip)  after  the  first  signs  of  change,  the  flagella  are  absorlied  by  a  pseudopodium 
which  pushes  out  close  to  tlieir  origin,  thrusts  them  aside,  and  flows  along  them  and 
engulfs  them. 

Til  is   process  may   go   some  way   and    then    suflrr   reversal.      On   one  occasion   a 
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flagellate  was  seen  to  rounil  up,  protrude  pseudopodia,  and  take  ou  typical  amoeboid 
form,  and  after  2^  minutes  gradually  to  recover  the  definite  flagellate  shape.  After 
this  it  was  watched  continuously  for  25  minutes,  and  then  showed  no  signs  of 
transition.  In  this  instance  of  the  flagellate's  attaining  the  amoeboid  form,  the 
flasella  did  not  lose  their  connection  with  the  nucleus. 


Text-fig.  1. — Transformation  of  Flagellated  into  Amffiboid  Stage  of  V.  soli. 

1,  Flagellate  stage;  2,  start  of  transformation;  3,  1  minute  after  start;  4,  2  minutes;  5,  3  minutes; 

6,  7  minutes;  7,  9  minutes;  8,  12  minutes;  9,  15  minutes;  10,  17  minutes. 

The  significance  of  these  flagellate  stages  is  at  present  unknown,  but  whether  their 

appearance  forms  grounds  for  removing  the  lima x-&mcehse  from  the  group  of  true 

Amoebae  and  placing  them  amongst  the  Proteomyxa  is  a  question  that  future  work 

must  decide. 

Amccha  nicumis  (n.  sp.). 

Amoeba  cucumis  was  also  met  with  in  the  cucumber  soil,  and  it  appeared  in  the 
cultures  of  manure-agar  plates  associated  with  Vahlkmnpjia  soli.  It  could  be  readily 
distinguished  from  the  Vahlkampfia  form  in  life  since  it  is  a  very  sluggish  form  with 
very  characteristic   pseudopodia.       Dividing  forms  were   most   frequently  found  on 
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cultures  three  to  four  days  old,  and  cysts  were  not  frequent  until  the  eighth  or 
ninth  day.  Active  forms  associated  with  cysts  have,  in  fact,  been  met  with  in 
cultures  over  a  month  old,  though  these  forms  were  then  of  rather  small  size.  In 
young  cultures  the  typical  appearance  seems  to  be  an  almost  spherical  form  with 
rather  fine  radiating  pseudopodia  composed  almost  entirely  of  ectoplasm.  Frequently, 
however,  forms  are  met  with  in  which  a  kind  of  wave  of  ectoplasm  is  thrown  out. 
As  far  as  can  be  seen  there  is  no  definite  contractile  vacuole.  This  form  is  not  nearly 
such  a  typical  devourer  of  bacteria  as  Vahlkmnpfia  soli,  since  it  not  infrequently 
ingests  small  flagellates,  cysts,  etc. 

The  cytoplasm  has  a  tendency  to  take  up  stains  rather  deeply.  The  nucleus  is 
almost  central,  it  consists  in  young  cultures  of  a  large,  darkly  staining  karyosome 
separated  by  a  considerable  space  from  a  distinct  membrane,  and  in  this  space  small 
particles,  composed  apparently  of  chromatin,  lie,  their  presence  being  more  noticeable 
in  forms  from  older  cultures.  The  division  of  this  form  seems  to  show  many  points 
of  similarity  to  that  described  by  Glaser  for  Amceba  lamellipodia. 

In  the  early  stages  of  division  the  karyosome  becomes  much  swollen  and  the 
cytoplasm  of  the  body  becomes  very  definitely  rounded  up  (Plate  6,  fig.  22).  The 
karyosome  then  becomes  broken  up  into  a  number  of  small  chromatin  masses,  and 
these  become  arranged  in  a  plate  which  may  extend  over  half  of  the  diameter  of  the 
animal  (Plate  6,  fig.  23).  The  chromatin-granules  are  at  first  ^^resent  in  a  number 
of  rows,  but  later  they  become  arranged  in  a  double  row,  thus  giving  rise  to  two 
distinct  equatorial  plates  (Plate  (5,  fig.  24).  These  two  plates,  which  lie  in  a 
difterentiated  tract  in  which  we  can  find  no  trace  of  a  definite  fibril  structure, 
become  gradually  shifted  apart  and  at  the  same  time  more  concentrated  (Plate  6, 
fig.  2()).  At  about  this  stage  the  first  signs  of  division  are  seen  in  the  cytoplasmic 
body.  The  chromatin-granules  now  become  lumped  into  masses  which  coalesce  to 
form  a  new  karyosome.  An  odd  feature  of  this  division  is  the  prominence  of  a  thin 
thread  connecting  the  daughter-individuals  (Plate  G,  fig.  28),  whicli  seems  absolutely 
identical  with  that  figured  by  Glaser  for  Amceba  lamellipodia.  In  many  cases 
encystatiou  seemed  to  be  preceded  by  an  association  of  two  individuals  correlated 
with  some  rather  complicated  nuclear  changes  ;  to  these,  however,  we  hope  to  return 
on  a  future  occasion.  The  ripe  cyst  in  life  is  a  thin-walled,  colourless  structure 
(Plate  0,  fig.  29).  Tlie  nucleus  lies  rather  eccentrically,  and  the  cytoplasm  is  at 
certain  stages  crowded  with  darkly  staining  masses  which  appear  to  be  extruded  by 
chromatin  granules. 

In  the  process  of  excystation  it  appears  that  tlie  cyst- wall  is  dissolved.  For 
exanqjle,  an  excysting  form  was  picked  up  at  3.30;  at  this  time  the  cyst-wall  was 
quite  definite  and  tlie  nucleus  rather  elongate.  Vacuoles  now  made  their  appearance 
at  the  periphery  of  the  cytoplasm  and  under  the  cyst-wall,  and  by  3.58  the  wall 
was  completely  dissolved  and  tlie  typical  membranous  pseudopodia  had  made  their 
appearance. 
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Seedling  Bed. 

Ill  March  of  this  year  one  of  us  liad  an  opportunity  of  examining,  through  the 
kindness  of  Mr.  Pitt  of  Abergavenny,  the  soil  of  a  frame  bed  in  which  seedHng 
cauliflowei's  were  being  grown.  The  bed  was  made  this  year  by  taking  the  first  spit 
of  soil  after  the  turf  had  been  removed  from  a  neighbouring  field  and  mixing  it 
with  sand  and  leaf  mould  ;  no  manure  was  added,  and  the  frame  was  not  heated,  so 
that  the  conditions  were  as  difterent  from  those  in  the  preceding  soil  as  could  be 
imagined. 

The  active  fauna  found  by  means  of  the  picric  acid  method  was  not  as  rich  in  the 
number  of  individuals  but  far  richer  in  the  number  of  species  than  the  cucumber  soil. 
This  case  seems  to  present  an  interesting  analogy  from  a  faunistic  point  of  view  to 
results  obtained  on  the  grass  plots  at  Rothamsted,  where,  as  is  well  known,  the 
untreated  plot  still  gives  a  large  number  of  species,  whereas  in  some  of  the  plots 
which  have  received  heavy  manure  of  one  kind  for  many  years  the  number  of  species 
has  been  cut  down  to  four  or  five. 

The  same  phenomenon  seems  to  be  shown  in  rich  infusions,  in  which  as  a  rule  one 
or  other  protozoon  gets  an  upper  hand,  whereas  in  ordinary  fresh-water  pools  the 
fauna,  though  far  richer  from  the  point  of  view  of  the  number  of  species  to  be  met 
with,  is  far  poorer  in  the  actual  numbers  of  individuals.  In  fresh  preparations  of  this 
soil,  of  which  only  a  few  were  carefully  examined,  the  following  were  some  of  the 
protozoa  met  with  in  a  trophic  condition  : — 

1.  Euglyplia,  ?  sp.  (Plate  5,  fig.  5).  5.  Amoeba,  ?sp.  (Plate  5,  fig.  9). 

2.  Chlamydophrys,  ?sp.  (Plate  5,  fig.  6).       6.  Amceha,  ?  sp. 

3.  Amoeba  gobanniensis,  n.  sp.  (Plate  5,         7.  Flagellate  amoeba. 

fig.  7).  8.  Bodo  caudatus  (Plate  6,  figs.  35,  36, 

4.  Amoeba,  ?  sp.  (Plate  5,  fig.  8).  and  37). 

On  cultures  from  this  soil  ximceba  diploidea,  a  monad  and  Prowazekia  sp.  ?  and 
a  small  ciliate  were  also  met  with.  It  is  probable  that  a  more  careful  search  would 
have  revealed  AmcBba  diploidea  in  a  trophic  condition  on  fresh  films  of  this  soil. 

In  this  paper  we  only  propose  to  note  some  of  the  forms  met  with  in  this  soil 
with  the  exception  of  an  amoeba.  Amoeba  gobanniensis  n.  sp.,  which  seems  of 
especial  interest  since  it  is  evidently  so  closely  allied  to  the  Amoeba  cucumis  found  in 
the  cucumber  soil. 

The  Chlamydophrys  found  in  this  soil  seems  to  be  rather  an  interesting  form  as  it 
differs  in  respect  of  its  size  and  behaviour  of  the  pseudopodia  from  the  Chlamydophrys 
which  one  of  us  cultured  out  of  a  soil  two  years  ago  and  has  kept  under  culture 
ever  since  in  the  hope  of  being  able  to  confirm  Schaudinn's  account  of  the  life-cycle 
of  this  form.  Further  details  of  this  form  will  probably  be  published  in  a  future 
paper. 
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An  amceboid  form  which  was  rather  interesting  from  the  readiness  with  which  it 
assumed  flagellate  condition  was  met  with  in  this  soil.  It  seemed  to  divide  in  the 
amoeboid  condition. 

A  relatively  large  amoeba  was  also  met  with  in  fresh  films  and  in  cultures  from 

this  soil. 

A  Bodo,  which  we  are  inclined  to  identify  with  the  Bodo  caudatus  of  Dujardin, 
was  met  with  both  on  the  fresh  smears  and  in  the  cultures.  As  will  be  remembered, 
this  species  is  characterised  by  a  peculiar  method  of  multiple  division,  and  it  is,  as  far 
as  we  are  aware,  the  only  flagellate  of  a  Bodo  type  for  which  this  method  of  division 
has  been  described  (Plate  6,  figs.  35,  36,  and  37). 

From  the  point  of  view  of  nomenclature  it  is  interesting  to  observe  that  this 
Bodo  has  a  typical  kinetonucleus,  and  this  would  seem  to  emphasise  a  point  already 
brought  out  by  Alexeieff  as  to  the  very  unsatisfactory  state  of  the  classification  of 
the  genera  Bodo  and  ProwazeUa.  Some  stages  of  the  division  of  this  form  are 
shown. 

The  behaviour  of  the  nucleus  seems  rather  complicated  and  will  be  dealt  with  more 
fully  by  one  of  us  in  a  future  paper  in  the  '  Zoologischer  Anzeiger.' 

Amoeba  gohanniensis,  n.  sp. 

This  amoeba  was  met  with  on  the  fresh  films  of  the  seedling  soil  and  on  cultures 
made  from  it  on  manure-agar  and  water. 

Amceha  gohanniensis  is  again  extraordinarily  like  the  Amoeba  lameUipodia 
described  by  Glaser(16)  and  the  previously  described  Amoeba  cucum,is,  but  here 
again  there  are  certain  distinctive  features  which  seem  to  show  the  necessity  for  the 
formation  of  a  new  species,  in  order  to  avoid  the  danger  of  including  two  different 
forms  under  the  same  specific  name. 

In  life  this  amoeba  is  a  very  sluggish  form,  and  is  readily  recognised  by  the 
extraordinary  development  of  its  ectoplasm.  The  characteristic  prickle  pseudopodia 
of  Amaiba  cucumis  (cf.  Plate  G,  fig.  21)  are  absent  in  this  form,  in  which  the  resting 
form  is  characterised  by  a  large  plate-like  ectoplasmic  pseudopodium  surrounding  the 
animal.  In  movement  a  long  axial  pseudopodium  from  the  eutoplasmic  core  seems  to 
be  thrown  out  by  the  animal. 

The  nucleus  in  the  stained  form  consists  of  rather  a  small  karyosome,  between 
which  and  the  membrane  lie  a  nuna])er  of  irregular  granules. 

The  behaviour  of  this  animal  in  division  recalls  in  many  points  the  division  of 
Amoeba  cucumis,  particularly  in  the  last  stage  of  division,  in  which  the  dividing 
pidchicts  remain  for  some  time  connected  ])y  a  narrow  thread  of  protoplasm  (Plate  6, 
fig.  34).  In  examining  a  large  number  of  division  stages  of  both  these  forms,  there 
seem  to  be  certain  constaTit  points  of  difference.  Firstly,  the  dividing  Amoeba 
gohanniensis  never  becomes  .so  spherical  as  the  Anurha  cnciimis,  and   it    is  always 
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chanicterisetl  liy  the  beliaviour  of  lliu  ectoplasm.  Secondly,  the  hand  of  chrornaliii 
granules  in  the  early  stage  of  division  is  not  nearly  so  broad  as  in  the  corresponding 
stage  of  Amceha  cucumis,  and  in  the  later  stages  the  spindle  seems  always  slightly 
smaller.  Thirdly,  there  is  a  more  definite  indication  of  fibrous  structure  in  th<; 
spindle  oi  Amoeba  gohanniensis  (Plate  G,  fig.  32). 

The  stages  leading  up  to  encystation  seem  to  show  similar  nuclear  complications  to 
those  tliat  have  been  noticed  in  Amaiha  cucumis,  and  the  cyst  seems  to  be  very 
similar. 

Conclusions. 

It  seems  generally  agreed  that  further  examination  of  the  Anaoeba3  will  necessitate 
the  splitting  up  of  the  genus  Amceha  into  a  number  of  genera.  The  first  and 
much  needed  step  in  this  reform  has  already  been  taken  by  Chatton  (13),  by  the 
formation  of  the  genus  Vahlkampfia  for  the  group  of  Umax  amosbse.  Whether 
it  would  not  be  better  to  put  this  genus  and  all  the  other  amoebse  which  show  a 
flagellate  stage  in  their  life-cycle  apart  from  the  gamete  stage  into  the  grouj) 
Proteomyxa  seems  to  us  an  open  question.  There  can  be  no  doubt  that  in  this  genus 
Vahlkampfia  a  number  of  quite  definite  species  are  included  which  can  probably  be 
best  separated  by  minute  differences  in  the  behaviour  of  the  nucleus  during  division. 
It  will  probably  be  found  necessary  in  the  same  way  to  form  another  genus  for  the 
lamellipodia  group  of  amoebas,  which  would  again  have  to  be  broken  up  into  a 
number  of  species  in  a  similar  manner. 

The  main  purpose  of  this  introductory  paper  has  not,  however,  been  the  study  of 
these  amoebae  from  a  specific  point  of  view,  so  much  as  the  proof  which  we  hope  to 
have  brought  of  the  existence  of  a  relatively  frequent  trophic  Protozoan  fauna  in 
certain  soils  and  the  rough  indication  of  some  possible  methods  of  dealing  with  this 
fauna.  How  far  this  fauna  under  certain  conditions  exercises  a  deleterious  influence 
on  plant  growth  is  rather  a  question  for  the  agriculturist  than  the  zoologist. 

The  startling  success  in  the  Lee  Valley  of  the  treatment  of  sick  soils  by  partial 
sterilisation,  introduced  by  Russell,  would  seem  to  present  a  very  strong  argument 
in  favour  of  the  view  that  these  protozoa  do  exercise  an  important  influence  on  plant 
growth  in  these  soils.  We  have,  by  means  of  the  method  described  above,  been  able 
to  establish  the  occurrence  of  a  trophic  Protozoan  fauna  in  certain  field  soils  that  we 
have  examined,  and  to  this  question  we  hope  to  return  in  a  future  paper. 
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DESCRIPTION   OF   PLATES. 

Plate  5. 

Fig.  1. — Euglypha  sp.  from  fresh  preparation  of  cucumber  bed. 

,,  2. —  Chilodon  sp.  from  fresh  preparation  of  cucumber  bed. 

,,  3. — Flagellate  from  fresh  preparation  of  cucumber  bed. 

,,  4. — Dividing  Vahlkam2)fia  soli  from  fresh  preparation  of  cucumber  bed. 

,,  5. — Euglyplia  sp.  from  fresh  jjreparation  of  seedling  bed. 

,,  G. — Chlamydoi^hrys  sp.  from  fresh  preparation  of  seedling  bed. 

,,  7. — Amceha  (johannicnsis  from  fresh  preparation  of  seedling  bed. 

,,  8. — Amceha  sp.  from  fresh  preparation  of  seedling  bed. 

,,  9. — Amoeba  sp.  from  fresh  preparation  of  seedling  bed. 

Vahlkamjijia  soli  (n.  sp.). 

(All  the  drawings  of  this  form  were  made  under  Zeiss  2  mm.  apoch.  +  18  comp.  oc.) 

Fig.  10. —  Vahlkampfia  soli  from  fresh  preparation  of  cucumber  bed. 
Figs.  11-16. — Stages  in  division  of  Vahlkampfia  soli. 
Fig.  17. — Flagellated  stage  of  Vahlkampfia  soli. 
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Plate  6. 

Vahlkam/pfia  soli  (n.  sp.). 
Fig.  18. — Trausitiou  to  amoeboid  stage  (whole  length  of  flagella  not  shown). 
,,     19. — Cyst  of  Vahlkamjjjia  soli. 

Amceha  cuctmiis  (n.  sp.). 
(Zeiss  2  mm.  apoch. +  1'-^  coinp.  oc.) 
Fig.  20. — Amcch'-t  cncnmis  from  fresh  preparation  of  encumber  bed. 

,,     21. — Amoeba  cucumis  from  young  culture. 
Figs.  22-28. — Stages  in  division  of  Amceha  cucumis. 
Fig.  29. — Cyst  of  Amoeba  cucumis. 

Amoeba  gohanniensis  (n.  sp.). 

(Zeiss  2  mm.  apoch.  +  12  comp.  oc.) 

Fig.  30. — Amoeba  gobanniensis  from  culture. 

Figs.  31-34. — Stages  \n  division  o?  Am,ceha  gobanniensis. 

Bodo  caudatus. 
(Zeiss  I'D  mm.  apoch.  +  18  comp.  oc.) 
Fig.  35. — Bodo  caudatus. 
,,     3G. — Multiple  division  of  Bodo  caudat/us. 
,,     37. — -Multiple  division  of  Bodo  caudatus,  later  stage. 
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1.  Introduction. 

Botrytis  cinerea  is  perhaps  the  commouest  and  best  known  fungus,  and  has  been  a 
centre  of  mycological  research  since  the  time  of  DE  Bary.  Few,  if  any  other  fungi, 
have  been  studied  so  thoroughly  by  so  naany  able  investigators,  or  are  the  subject  of 
so  extensive  a  literature ;  and  one  need  only  draw  attention  to  the  researches  of 
DE  Bary  (6),  Pirotta  (61),  Marshall  Ward  (80),  Kissling  (45),  Nordhausen(58), 
R.  E.  Smith  (70),  Farneti  (27),  Beauverie  and  Guilliermond  (9,  10),  Istvanfei  (33), 
Reidemeister  (65),  and  the  more  recent  researches  from  the  laboratory  of  Prof.  V.  H. 
Blackman  (13,  16,  17,  84).  There  is  a  body  of  experimentally  ascertained  and 
exact  knowledge  concerning  the  bionomics  of  this  fungus,  which  can  be  exceeded  by 
that  of  few  other  micro-organisms. 

For  the  particular  purposes  in  view  in  my  investigations,  it  has  been  imperative 
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that  all  experimental  work  be  carried  out  with  pedigree  cultures.  It  was  very  early 
discovered  that  the  ordinary  "  pure-culture "  of  mycological  and  bacteriological 
laboratories  may  be  and  very  often  is  a  poly-genotypic  population,  and  that  very 
considerable  genotypic  differences  may  usually  be  found  in  apparently  homogeneous 
populations  exhibiting  only  one  single  type,  around  which  the  individuals  fluctuate. 
In  order  to  eliminate  this  vitiating  factor,  single-spore  cultures  were  prepared  either 
by  BuRRi's  (19)  Indian-ink  method,  plate  isolation,  or,  more  usually,  by  a  combination 
of  the  plate  and  dilution-drop  method.  These  cultures  served  as  the  initial  source  of 
experimental  material.  In  all,  over  seventy  such  pedigree  stocks  have  been  prepared 
from  original  sub-strata,  representing  about  sixty  different  species  of  host  plant, 
derived  from  all  parts  of  the  country.  From  these  stocks  some  fifteen  thousand 
cultures  have  been  made.  During  the  course  of  the  investigation,  many  of  these 
have  been  subjected  to  the  most  diverse  environmental  conditions,  and  all  have  been 
maintained  under  the  closest  scrutiny.  With  the  exception  of  the  single  culture  to 
be  described,  no  change  that  could  be  interpreted  as  a  permanent  heritable  alteration 
or  mutation  has  been  observed. 

2.   Life  Cycle  of  Botrytis  cinerea. 

In  nature,  the  fungus  usually  appears  as  a  delicate  smoke-grey  velvety  pile, 
covering  diseased  plant  tissues  or  organic  debris,  or,  oil  more  resistant  sub-strata,  in 
the  form  of  isolated  grey  pustules.  This  superficial  growth  consists  of  the  branched 
couidiophores  bearing  spores.  On  germination,  which  under  favourable  conditions 
occurs  in  a  few  hours,  each  of  the  latter  gives  rise  to  a  mycelium,  which  in  two  or 
three  days  produces  a  further  crop  of  conidiopliores  and  spores.  When  old,  the 
h yplise  and  spores  may  produce  mlcroconidia,  which  germinate  directly  giving  rise  to 
mycelium.  If  at  any  stage  of  development  conditions  unfavourable  to  sporogeny 
intervene,  the  mycelium  produces  sclerotia,  and  these,  under  more  favourable 
conditions,  give  rise  to  tufts  of  conidioj^hores  bearing  spores.  Under  certain 
adverse  conditions,  the  hyphse  may  produce  chlamydospores,  or  themselves  break  up 
into  oidia,  and  these,  on  germination,  give  rise  to  normal  mycelium. 

It  was  tlie  opinion  of  UE  Bary  (5,  6),  Viala  (79),  Zopf  (89),  and  other  early 
investigators,  an  opinion  based  purely  on  -superficial  resemblances  and  the  frequent 
contiguity  of  growth  of  the  organisms,  that  Botrytis  cinerea  is  only  the  conidial 
phase  of  a  Discomycetous  fungus,  which  was  referred  to  either  Sclerotinia 
Fuckeliana  or  S.  lihertiana.  In  the  absence  of  any  proof  of  this  connection,  and 
despite  their  own  negative  experience,  this  opinion  was  accepted  by  Marshall 
Ward  (80),  Kisslinu  (45),  and  others,  and  so  became  an  integral  part  of  myco- 
logical literature.  In  1905,  IsTVANFFI  (32)  published  a  voluminous  memoir, 
purporting  to  bring  forward  proof  of  the  genetic  relationship  of  Botrytis  cinerea 
and  Sclerotinia  Fuckeliana.  In  spite  of  the  prior  and  much  more  incisive  work  of 
R.   E.    Smith  (20),   this  evidence  has  been  generally  accepted.     It  is,  however,  of 
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doubtful  value,  for  a  critical  examination  of  the  investigation  makes  clear  that 
IsTVANFFi  has  confused  two  distinct  fungi,  which  possess  certain  superficial 
resemblances,  often  grow  together  on  the  same  host,  and  are  not  easily  separable 
in  culture. 

Recently,  Seaver  and  HoRNE  (68)  claim  to  have  established  the  relationship  of  an 
unnamed  species  of  Botrytis  with  a  new  species  of  Sdcrotmia,  which  they  have 
termed  S.  Geranii.  The  evidence  is  very  brief,  and  awaits  confirmation,  and,  in 
view  of  the  fact  that  many  "pure-cultures"  of  fungi  are  undoubtedly  mixed 
populations,  that  Botrytis  cinerea  and  various  species  of  Sclerotinia  and  other 
genera  harmonise  perfectly  in  their  growth  when  developing  intermixed,  both  on 
natural  hosts  and  on  many  artificial  media,  and  that,  unless  specially  searched  for, 
the  very  minute  microconidia  of  both  forms  are  easily  overlooked,  and  may  act  as  a 
contaminating  factor,  one  may  perhaps  hesitate  to  accept  unreservedly  this  evidence. 
Furthermore,  in  my  own  study,  some  fifteen  thousand  cultures  of  various  strains  of 
Botrytis  cinerea  and  nearly  related  "  species  "  have  been  closely  observed  under  the 
most  varied  environmental  conditions,  and  much  experimental  work  has  been  specially 
,  directed  towards  the  elucidation  of  the  genetic  relationships  of  the  fungus.  I  have, 
however,  found  no  evidence  indicating  that  Botrytis  cinerea  is  in  any  way  a 
developmental  phase  within  the  life-cycle  of  Sclerotinia  Fuckcliana  or  genetically 
related  to  any  other  species  of  this  genus.  Repeatedly,  strains  of  Botrytis  cinerea 
and  various  species  of  Sclerotinia  have  been  grown  side  by  side  by  inoculating 
alternate  quadrants  of  a  plate  culture,  and,  under  all  conditions,  have  remained 
separate.  My  experience  merely  confirms  that  of  LiND  (47),  R.  E.  Smith  (70), 
Peltier  (59),  Pethybridge  (60),  and  others,  who  have  paid  special  attention  to  this 
aspect  of  the  problem. 

Botrytis  cinerea  as  a  discrete  entity  is  an  asexual  fungus,  and  the  critical  import- 
ance of  a  Sclerotinial  relationship  lies  in  the  possibility  which  this  introduces  of  a 
sexual  process.  If  the  organism  be  sexual,  any  single  individual  may  possibly  be 
heterozygous,  and  there  is  then  no  inherent  improbability  that  segregation  may  occur, 
resulting  in  the  appearance  of  apparently  new  forms  which  might  mistakenly  be 
interpreted  as  mutants. 

In  many  Mucorinece,  ChytridinecB,  Saccharomyces  and  possibly  other  Ascomycetes 
the  sexual  process  is  allogaraous  and  factorial  segregation  is  not  impossible.  In  the 
great  majority  of  fungi,  however,  in  which  sexuality  occurs,  and  here  is  included  so 
far  as  is  known  the  entire  group  DiscomycetinecB,  which  contains  the  genus  Sclerotinia, 
the  process  is  autogamous.  Now  it  is  well  known,  and  has  recently  been  mathe- 
matically demonstrated  by  Jennings  (35),  that  even  in  an  originally  heterozygous 
organism,  self-fertilisation  if  continued  generation  after  generation  leads  rapidly  to  a 
condition  in  which  the  offspring  are  homozygous.  In  such  case  Johannsen  (36,  37) 
has  shown  that  the  mere  isolation  of  the  progeny  from  a  single  reproductive  member 
produces  a  pure  line. 
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Even,  therefore,  in  the  remote  contingency  that  Botrytis  cinerea  is  merely  a 
developmental  phase  of  Sclerotinia  Fuckeliana  and    that    this   possesses   a   sexual 

process which  has  yet  to  be  demonstrated  for  any  species  of  this  genus— it  would 

appear  that  the  complications  resulting  from  heterozygosis  are  absent  from  our 
problem.  For  critical  purposes  Botrytis  cinerea  is,  on  all  evidential  criteria,  an 
asexual  homozygotic  organism  in  which  the  isolation  of  a  single  spore  strain  necessarily 
implies   the  isolation   of  a   "  pure-line."     A   genotypic   change   in   a  pure   line    is    a 

mutation. 

3.  Origin  of  Colourless  Form.     General. 

On  July  10,  1917,  a  diseased  specimen  of  Grassula  jierforata  from  the  succulent 
house  of  the  Royal  Botanic  Gardens,  Kew,  was  examined,  and  the  causal  pathogen 
identified  as  Botrytis  cinerea.  The  growth  was  normal  in  every  respect,  and  from  it 
six  cultures,  (A.  1-6),  were  prepared.  These  appeared  to  be  free  from  contamination  by 
other  organisms,  and  on  July  16,  six  further  cultures,  (B.  1-6),  were  taken  from  (A.  6). 
On  July  22  all  twelve  were  carefully  examined  and  found  to  be  free  from  contamina- 
tion. A  series  of  dilution  drops  were  then  prepared  from  (B.  6),  many  of  these  drops 
containing  but  a  single  spore.  The  latter  were  transferred  to  a  thin  plate  of  potato- 
agar  and  their  development  scrutinised  carefully  during  two  days.  A  colony  whose 
origin  from  a  single  spore  had  been  ascertained  with  certainty  was  then  picked  up  on 
a  platinum  wire  and  planted  in  a  tube  slant.  The  spores  are  comparatively  large,  the 
most  common  size  on  the  host  plant  being  7 '5  )U,X9'5  fi  ;  they  are  thus  easily  seen 
and  isolated.  On  germination  they  become  still  more  obvious,  and  the  obtaining  of  a 
single-spore  strain  is  thus  a  simple  procedure.  On  August  10  twelve  cultures 
(D.  1-12),  were  prepared  from  the  pedigree  stock  (C.  1),  and  on  August  23  six  further 
cultures,  (E.  1-6)  were  made  from  (D.  12).  So  far  the  work  had  merely  been  a  part 
of  the  ordinary  laboratory  routine,  the  cultures  which  were  all  on  tube  slants  of 
potato-agar  being  used  for  various  experimental  purposes. 

About  a  week  later  cultures  (E.  1-6)  were  examined  and  some  three  or  four  days 
later  ri'-examined.  The  six  cultures  had  all  been  placed  together  in  a  wire  basket 
and  left  on  the  laboratory  bench.  It  was  noted  that  in  tube  (E.  6)  one  of  the  sclerotia 
had  not  turned  black  like  the  others  but  had  remained  colourless.  In  certain  strains 
of  Botrytis  cinerea  these  bodies  do  not  form  the  black  pigment  characteristic  of  the 
genus  until  they  are  mature,  and  although  this  particular  sclerotium  differed  from  its 
fellows  in  apparently  following  this  mode  of  behaviour,  it  was  merely  regarded  as  a 
somewhat  youthful  individual,  perhaps  a  little  unusual,  but  not  particularly  note- 
worthy. On  September  10,  however,  when  tubes  (E.  1-6)  were  again  examined  and 
the  aberrant  sclerotium  in  (E.  6),  now  quite  mature,  was  found  to  be  still  lacking  any 
sign  of  pigmentation,  it  was  decided  to  isolate  this  sclerotium  and  breed  from  it.  In 
tube  (E.  6)  there  were  in  addition  to  the  colourless  sclerotium  24  normal  black 
sclerotia  and  a  black  sclerotial  crust  around  the  bottom  of  the  tube,  where  the  medium 
had  contracted  away  from  the  glass. 
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The  colourless  sclerotium  was  removed  and  thoroughly  washed  in  sterile  distilled 
water  to  remove  adherent  spores.  It  was  then  broken  open  and  a  fragment  from  the 
centre  taken  out,  placed  on  a  tube-slant  qf  potato-agar  (F.I.  6)  and  incubated  at 
22°  C.  A  vigorous  growth  ensued  and  a  week  later  sclerotia  began  to  be  formed. 
In  the  tube  there  were  ultimately  formed  17  sclerotia  and  all  were  colourless. 
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Crassula    perforata 


•     •     •     •     •    • 

12         5        15        6 

•        •        • 

1 
•       •       • 

D 


10 

_L 


E 


G 


04- 


July  lO^l- 1917. 


.  July  lOL^  1917. 


5 

I 


July  16'!' 1917. 


-.•^'t'Jr^yj^iy^^'-'i^iy- 


i    I    I    I    I    I    I    1    I    I    M 
•   ••••••••••• 


•  ©. 

5        6 

_L_ 


-/«.• 


-ICL* 


-lai 


Ib.O 


lb.O 


tb.O 


-  August  10 tl' 1917 

-  August  23'^  1917. 
September  IOi'>l9IZ 


}    190  Generations. 


In  a  normal  sclerotium  the  pigmentation  is  confined  to  the  outer  one  to  three 
layers  of  cells  (text-fig.  1)  and  as  it  is  from  these  that  the  conidiophores  arise  on 
germination,  it  was  considered  not  impossible  that  these  cells  and  their  derivatives 
only  might  carry  the  power  of  colour  formation.  If,  therefore,  a  fragment  of  the 
inner  hyaline  tissues  of  such  a  normal  sclerotium  were  planted  on  a  nutrient  medium 
it  seemed  possible  that  colourless  sclerotia  similar  to  those  obtained  in  tube  (F.I.  h) 
might  develop.  Accordingly,  a  black  sclerotium  from  {\i.  6)  was  treated  thus — 
exactly  as  the  aberrant  sclerotium  had  been  treated.  The  subsequent  culture  (F.I.  a) 
contained  uniformly  black  sclerotia. 

There  were  now  in  existence  two  distinct  strains  (F.I.  a)  and  (F.I.  h)  of  the  fungus 
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both  derived  from  a  single-spore  stock  (C^.  I).  Of  these  the  parental  form  (F.I.  r^) 
possessed  the  highly  characteristic  black  sclerotia;  whilst  the  new  strain  (F.I.  6) 
which  had  arisen  from  this  apparently  by  a  single  direct  saltation,  possessed  colourless 
sclerotia.  The  history  of  the  two  forms  is  shown  in  a  diagrammatic  manner  in 
Table  I.  The  distinguishing  lettering  (A.  1-6)  and  so  forth,  and  the  arrangement  of 
descent  in  this  Table  have  been  adopted  for  the  sake  of  clearness  in  the  account.  In 
practice  (D.  1-12)  were  merely  12  similar  cultures  taken  from  (C.  1),  and  (E.  6)  was 
taken  from  one  of  these  cultures  chosen  at  random.  In  the  same  way  six  cultures 
were  made  on  July  16  and  from  one  of  these  chosen  at  random  (C.  1)  was  prepared. 

It  seemed  possible  that  an  albinistic  tendency  might  be  a  "  weakness  "  inherent  in  this 
particular  race  of  Botrytis  cinerea,  and  that  if  the  preceding  generations  were  further 
subcultured  colourless  sclerotia  might  again  appear.  Unfortunately,  the  onguial 
diseased  plant  and  the  first  six  cultures  made  from  it  had  been  destroyed,  but  all 
subsequent  cultures  had  been  retained.  As  no  particular  care  had  been  taken  of 
these  some  had  become  contaminated.  In  all  cases,  however,  it  was  possible  to  sub- 
culture purely  from  them  and  this  was  done  extensively  for  many  generations  under  the 
most  diverse  conditions  of  light,  temperature,  food  supply  and  so  forth,  all  the  result- 
ing cultures  being  most  carefully  examined  for  any  indication  of  a  lack  of  sclerotial 
pigmentation.     All  were  perfectly  normal. 

The  parental  strain  and  its  colourless  derivative  have  each,  up  to  the  time  of 
writing,  passed  through  over  two  hundred  direct  tube  generations  with  very  many 
lateral  subcultures,  and  although  these  have  been  placed  under  diverse  environmental 
conditions  each  strain  has  remained  absolutely  constant. 

Since  this  colourless  strain  arose,  many  thousands  of  cultures  and  fresh  specimens 
of  Botrytis  cinerea  have  passed  under  my  observation,  but  in  no  single  instance  have 
I  found  any  similar  absence  of  sclerotial  pigmentation.  Prof.  H.  H.  Whetzel,  of 
Cornell  University,  who  for  some  years  has  been  engaged  upon  a  monograph  of  the 
genus  Botrytis  and  has  received  and  compared  material  from  many  parts  of  the  world, 
informs  me,  after  having  examined  a  culture  of  the  colourless  strain,  that  he  has 
"  never  seen  anything  quite  like  it."  In  the  voluminous  literature  devoted  to  the 
funsus  there  is  no  record  of  a  form  with  colourless  sclerotia. 

Having  regard,  therefore,  to  our  knowledge  of  the  fungus  Botrytis  cinerea  and  to 
the  particular  conditions  under  which  the  aberrant  strain  originated,  it  would  appear 
difficult  on  present  criteria  to  interpret  it  as  other  than  a  true  mutation. 

4.    Comparison  of  Parental  Strain  and  its  Colourless  Derivative. 

The  obvious  difference  between  the  original  and  the  derived  forms  lies  in  the 
colour  of  the  sclerotia.  Mutations  in  the  fungi  Aspergillus  niger  and  Penicillium 
glaucum  have  been  described  at  length  by  Akoichovsku  (2),  Waterman  (83)  and 
ScHiEMANN  (6iJ).      In  these  cases,  however,  although  the  obvious  change  was  in  sjDoi'e 
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colouration,  there  were  other  less  visible  changes  in  several  independent  respects, 
certain  morphological  characters  and  physiological  activities  of  the  derived  fungi 
being  different  from  those  of  the  original  forms.  In  addition  to  these  facts,  the 
results  of  recent  researches  on  the  linkage  of  characters  and  the  multiple  allelomorph 
interpretation  of  quantitative  inheritance  made  it  appear  probable  that  in  the  present 
case  the  colour  cliange  might  be  oiJy  one  of  many  related  changes  less  obviously 
visible. 

Moreover,  whether  one  regards  the  black  sclerotial  j^igment  as  a  mere  excreted 
product  or  as  an  integral  and  actively  functioning  substan"ce  in  the  metabolic 
activities  of  the  organism,  the  pigment  itself  is  the  result  of  a  long  and  elaborate 
series  of  causally  dependent  processes,  and  a  sudden  and  permanent  loss  of  the  power 
to  excrete  or  form  such  matter  can  only  be  the  visible  expression  of  deep-seated 
physiological  changes  in  the  developing  organism.  But  a  living  organism  is  an 
extremely  complex  and  delicate  equilibration  of  rhythmic  metabolic  processes  in  a 
colloid  substratum,  and  it  is  difficult  to  conceive  of  any  sudden  dei-aiigement  of  these 
activities  which  has  not  reverberations  in  many  directions. 

It  seemed,  therefore,  very  desirable  to  make  some  comparison  of  the  parental 
strain  and  its  colourless  derivative  to  ascertain  the  nature  of  any  changes  other  than 
pigmentation  which  might  be  present.  This  comparison  proceeded  along  several  lines, 
the  main  directions  of  which  are  indicated  briefly  below. 

A.  Physiological  Activities. 

If,  as  evidence  would  seem  to  show,  the  physiological  reaction  of  a  particular 
organism  is  constant  only  in  an  unvarying  environment,  it  follows  that  identity  of 
response  to  like  stimuli  implies  identity  of  constitution,  and,  conversely,  that  a 
change  in  physiological  constitution  will  be  reflected  in  a  changed  reaction  to 
unaltered  conditions.  Only  if  this  is  true  may  a  comparison  of  the  physiological 
activities  of  two  organisms  be  instituted  ;  and  being  true,  it  will  aftbrd  the  most 
delicate  test  of  individual  or  genotypic  identity,  for  the  technique  is  quantitative  and 
the  nicest  differences  may  be  measured. 

(a)  Germination  of  Spores. — The  effect  of  the  character  of  the  nutrient  medium 
upon  the  germination  of  fungus  spores. is  often  very  marked,  and  not  infrequently 
this  relation  may  be  used  to  differentiate  two  or  more  genotypes  otherwise  difficult  to 
distinguish.  The  percentage  germination  within  certain  periods  of  time  when  the 
spores  were  immersed  in  distilled  water,  15  per  cent,  gelatine  in  water,  and  CooNS' 
solution  (21)  were  compared  in  the  two  experimental  forms  and  in  two  strains  used  as 
a  control.  The  spores  were  taken  from  three-day  cultures  on  potato-agar  incubated 
at  22°  C,  and  the  hanging  drops  in  which  the  germinations  were  tested  were  main- 
tained in  the  cool  incubator  at  18°  C.  The  results  are  shown  below,  the  numbers 
representing  percentage  germinations. 
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Medium. 

Parental  Strain. 

Colourless  Strain. 

Control  Strain  A. 

Control  Strain  B. 

6 
hours. 

12 
hours. 

24 
hours. 

6 
hours. 

12 
hours. 

24 
hours. 

6 
hours. 

12 
hours. 

24 
hours. 

6 
hours. 

12        24 
hours,  hours. 

Distilled  water 

10 

27 

55 

12 

25 

53 

33 

51 

68 

25 

55 

93 

15    per    cent. 

gelatine 
Coons'  solution 

43 

67 

93 

41 

66 

90 

47 

78 

90 

47 

72 

90 

63 

83 

97 

61 

80 

96 

63 

90 

98 

61 

72        98 

The  lethal  temperature  of  the  spores  was  tested  by  making  spore  suspensions  in 
15  per  cent,  gelatine  in  distilled  water  and  heating  corresponding  tubes  of  parental, 
derived,  and  control  strains  together  in  a  water  bath  for  ten  minutes,  the  temperature 
rising  by  increments  of  five  degrees.  The  experimental  strains  both  gave  growth  in 
tubes  heated  to  45°  C,  but  not  in  those  heated  to  50"  C,  whilst  in  the  control  tubes 
growth  was  inhibited  by  temperatures  of  45°  C.  and  55°  C.  respectively. 

(b)  Growth  on  Nutrient  Media. — -One  of  the  most  critical  and  easily  determined 
measures  of  the  identity  of  two  fungi  is  their  growth  and  behaviour  upon  various 
standardised  nutrient  media  under  controlled  conditions.  The  general  characters  of 
the  colony,  the  colour  reactions  of  hyphaj  and  medium,  and  the  many  other  detailed 
phenomena  which  may  be  observed  are  valuable  diagnostic  characters,  and  sharply 
reflect  the  physiological  differences  separating  two  genotypes.  At  the  end  of  his 
paper  on  "Cultural  Studies  of  Species  of  Penicilliimi,"  Thom  (77)  has  drawn  up  a 
scheme  of  comparative  cultural  data,  which,  in  the  present  absence  of  any  more 
detailed  formulation,  could  with  valuable  results  be  adopted  by  mycologists  as  a  basis 
of  systematic  diagnosis.  In  the  comparative  examination  of  the  experimental  strains 
this  technique  was  followed,  and  the  results  compared  with  those  of  two  control 
strains.  No  essential  difference  could  be  detected  between  the  former,  but  these 
stood  in  marked  contrast  with  the  latter.  For  laboratory  purposes  other  than  this 
particular  investigation,  the  parental  strain  and  its  colourless  derivative  wei-e  largely 
used  in  experiment,  so  that  the  cultural  comparison  of  the  two  forms  was  amplified 
in  very  many  ways.  In  all  cases  where  they  were  grown  under  parallel  conditions 
no  essential  difference  could  be  detected.  The  comparative  development  of  the  two 
strains  and  control  cultures  was  tested  further  at  various  temperatures,  and  with 
different  intensities  of  light  and  conditions  of  aeration,  but  the  results  obtained  only 
contirmed  those  referred  to  above. 

A  few  qualitative  experiments  were  carried  out  to  test  the  comparative  enzyme 
production  of  the  fungi,  the  methods  used  being  adapted  from  those  described  by 
CiiABiLL  and  Reed  (22)  in  1915.  On  starch-agar  the  experimental  strain  and  one 
control  showed  approximately  equal  amylolytic  action  under  the  colonies,  while  the 
second  control  showed  a  distinct  halo  indicating  the  presence  of  a  diffusible  extra- 
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cellular  amylase.  These  differences  were  thrown  up  more  clearly  when  the  plates 
were  flooded  with  iodine  solution.  On  litmus-cream-agar  growth  of  the  experi- 
mental strains  was  profuse,  and  there  was  strong  acid  production,  the  red  colour 
diffusing  through  the  medium.  In  both  control  strains  the  lipolytic  action  was 
equally  marked  but  the  redness  much  more  sharply  defined.  All  the  four  strains 
liquefy  gelatine  and  slowly  dissolve  fibrin  in  fibrin-agar.  In  both  experimental 
strains  the  lipase  production  is  feeble  and  only  slight  browning  of  the  fibrin  occurred, 
this  being  markedly  different  from  the  results  in  the  controls.  On  casein-agar 
moderate  growth  occurred,  and  ereptic  action  was  prominent,  extending  in  a  distinct 
band  around  the  colonies.  The  production  of  erepsin  was  distinctly  greater  in  both 
controls  than  in  the  experimental  strains.  Sparse  development  occurred  on  asparagin- 
rosolic-acid-agar  with  slight  amidase  production.  The  red  colour  in  the  experimental 
strains  was  less  diffuse  than  in  the  controls. 

Cytase  formation  was  tested  by  growing  the  organisms  in  Dox's  solution  (23)  the 
carbon  source  being  filter  paper.  Moderate  growth  occurred,  but  after  a  fortnight 
there  was  only  a  barely  perceptible  reduction  with  Fehling's  solution  in  the  experi- 
mental strains  and  one  control  strain,  whilst  the  second  control  showed  more  distinct 
cytolytic  action.  Calcium-carbonate-agar  prepared  with  Dox's  solution  (23)  plus 
one  per  cent,  of  lactose  showed  a  feeble  though  distinct  production  of  lactic  acid, 
this  being  more  marked  in  the  experimental  than  in  the  control  strains.  In  all  the 
experiments  no  essential  differences  in  the  enzymic  activities  of  the  normal  strain  and 
its  aberrant  form  could  be  detected. 

(c)  Patliogenicity. — Marked  and  constant  differences  in  virulence  for  different 
hosts  are  shown  by  the  various  strains  of  Botrytis  cinerea.  In  testing  the  com- 
parative pathogenicity  of  the  experimental  strains,  the  spores,  together  with  a 
fragment  of  potato-agar,  were  planted  on  corresponding  surfaces  such  as  the  opposite 
sides  of  a  fruit,  or  opposite  halves  of  a  leaf.  On  banana,  apple,  and  tomato  fruits, 
lily  and  potato  leaves,  tulip  bulbs,  and  twigs  of  horse-chestnut  infection  readily 
occurred,  and  vigorous  development  ensued.  On  onion-bulb  scales,  potato  tubers, 
crab-apple,  cucumber,  and  fruits  of  Pyrus  japonica,  leaves  of  Primula  sinensis  and 
wall-flower,  infection  occurred  with  very  considerable  difiiculty  and  only  when  the 
tissues  were  budly  bruised,  and  the  growth  tended  to  die  out  rapidly.  Bulbs  of 
snowdrop  and  leaves  of  Portuguese  laurel  and  rhododendron  could  not  be  infected. 
In  no  case  was  any  essential  difference  in  pathogenicity  noted  between  the  parental 
and  colourless  strains.  Markedly  different  results  were  obtained  with  the  two 
control  strains,  of  which  one  was  originally  derived  from  onion  bulbs  and  the  other 
from  fruits  of  Pyrus  japonica. 

The  cultural  examination  of  the  parental  strain  and  its  colourless  derivative, 
although  put  separately  above,  was  only  part  of  a  much  larger  series  of  experiments 
which  were  being  carried  out  in  the  laboratory  to  ascertain  the  physiological 
differences  of  closely  similar  morphological  strains,  and  the  identity  of  result  obtained 

VOL.  ccx. — B.  N 


92  ME.  W.  B.  BRIEELEY   ON   A  FORM   UF 

in  this  comparison  very  largely  takes  its  critical  value  from  its  relation  to  this  wider 
issue.  Briefly,  it  may  be  stated  that  so  far  as  the  physiological  constitutions  of  the 
parental  strain  and  the  colourless  strain  may  be  judged  from  their  behaviour,  the  two 
strains  are  identical. 

B.  Morphological  Characters. 

Phenotypic  plasticity  is  shown  by  most  fungi,  and  is  not  only  evident  in  those 
general  characters  which  together  constitute  the  morphological  facies  of  the  organism, 
but  in  those  critical  structures,  the  reproductive  bodies,  whose  constancy  has  so  often 
been  assumed. 

(d)  Dimensions  of  /S^wres.-^During  recent  years  it  has  been  shown  in  a  general 
way  by  many  students  of  the  fungi,  notably  perhaps  by  Stevens  and  Hall  (76), 
and  following  their  investigations  by  Elliott  (26),  Moreau  (53),  Mutto  and 
POLLACCI  (57),  Gaumann(30,  31),  and  others,  that  the  size  of  fungus  spores  is  a 
function  of  the  particular  organism  and  the  environmental  conditions.  With  regard 
to  the  present  fungus,  Botrytis  cinerea,  undei-  certain  standardised  conditions,  the 
spore  dimensions  are  a  function  of  the  particular  pure  line  and  the  quality  of  the 
nutrient  substratum  ;  and  for  any  particular  pure  line,  different  standardised  food- 
media  give  different  but  mathematically  constant  quantities  for  the  modal  values  of 
the  variation  curves  of  the  spores.  On  a  series  of  standardised  media  under  constant 
conditions,  therefore,  the  modal  values  of  the  spores  of  any  pure  line  of  the  fungus 
may  be  represented  by  a  curve  which  is  constant  and  characteristic  for  that  pure  line. 
The  critical  quantity  is  not  the  extremes  of  variation,  but  the  curve  of  modal  values  ; 
and,  other  factors  not  varying,  a  diflerence  of  genotype  is  immediately  reflected  in 
this  curve  of  modal  values. 

The  modal  values  of  the  spores  of  the  parental  strain  and  its  colourless  derivative 
were  compared  in  this  way,  the  number  of  measurements  for  each  particular  test 
being  500.     The  results  of  three  such  comparisons  are  shown  on  p.  93. 

The  conidiopliores  of  the  fungus  are  comparatively  large  and  complexly  branched 
structui-es,  and  the  mechanical  difliculties  in  the  way  of  their  minute  comparison  are 
such  as  to  render  it  difficult  to  eliminate  errors  of  selection.  In  so  far,  however,  as 
these  structures  could  be  compared  with  regard  to  the  length,  diameter,  manner  of 
branching  and  abstriction  of  spores  no  differences  could  be  distinguished  between 
the  normal  strain  and  its  aberrant  form  ;  and  this  holds  true  also  for  the  colour  of 
the  spore  mass. 

(e)  Hapteixi. — One  oi'  the  most  interesting  structural  features  of  Botrytis  cinerea 
are  the  organs  of  attachment,  which  were  described  by  DE  Bary(5,.  6)  under  the 
name  "  Haft-organen,"  and  are  variously  termed  "  haptera"  or  "  appressoria." 

Their  presence  or  absence  in  a  culture  is  determined  by  environmental  conditions, 
and  the  controlling  factors  are  different  for  the  several  strains  of  the  fungus.  The 
experimental  strains  behaved  similarly  in   this   respect,  forming  few  haptera  when 
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incubated  at  22°  C.  on  Czapek's  medium  lacking  carbon,  producing  them  in 
abundance  when  M/10  glucose  is  added  to  this  medium,  and  suppressing  their 
formation  when  M/1  glucose  is  added.  A  similar  course  of  behaviour  is  evident 
when  the  substratum  remains  constant  and  the  temperature  varies.  The  particular 
relations  between  hapteral  formation  and  controllable  external  factors  are  very 
individual  for  the  several  strains  of  Botrytis  clnerea,  and  constitute  a  diagnostic 
feature.  The  identity  of  behaviour  between  the  normal  strain  and  the  colourless 
strain  is  therefore  noteworthy. 

(f)  Sclerotia. — The  type  of  correlation  between  hapteral  formation  and  environ- 
ment which  has  been  indicated  above  holds  true  in  the  formation  of  sclerotia,  and  the 
presence  or  absence  of  these  in  any  particular  strain  may  be  determined  at  will.  It 
may  briefly  be  stated  that  under  all  circumstances  the  modes  of  behaviour  of  the  two 
strains  with  respect  to  this  structure  were  alike. 

Owing  to  the  widely  held  belief  regarding  the  constitutional  weakness  of  albinos,  as 
compared  with  the  normal  type,  it  was  considered  not  improbable  that  the  colourless 
sclerotia  might  show  a  lack  of  vitality  and  power  to  withstand  adverse  conditions. 
No  indication  of  such  weakness  had  appeared  in  the  general  cultural  comparison  of  the 
two  strains,  but  it  seemed  desirable  to  test  the  sclerotia  themselves,  for  it  is  a  common 
assumption  that  dark  coloured  or  "carbonised"  hyphae  are  more  resistant  than 
hyaline  hyphse.  The  comparison  was  made  by  testing  the  relative  germinative 
capacity  of  selected  black  and  colourless  sclerotia  after  subjection  to  various  treat- 
ments. Parallel  series  exposed  to  extremes  of  temperature  and  desiccation  for 
various  periods  exhibited  no  difierential  resistance,  nor  was  this  found  on  subjecting 
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the  sclerotia  to  saturated  atmospheres  of  chloroform  and  ether,  or  various  dikitions 
of  toxic  substances  such  as  mercury  bichloride,  copper  sulphate,  carbolic  acid,  and 
picric  acid.      Other  strains  used  as  controls  gave  markedly  different  results. 

In  structure  the  sclerotia  vary  according  to  the  conditions  under  which  they  are 
formed.  In  contact  with  a  hard  surface  they  show  an  outer  compact  zone  two  or 
three  cells  in  thickness  which  merges  into  an  inner  zone  consisting  of  a  firm  colour- 
less tissue-mass  formed  by  closely  interweaving  hyphoe  (text-fig.  1).  On  a  soft 
matrix  a  third  central  zone  of  loose  spongy  hypha?  is  usually  present.  The  latter 
type    of  sclerotia  are  more  or  less  spherical  in   shape   whilst  the  former  often  fuse 


Text-fio.  1. 

1.  Vertical  sections  of  colourless  and  normal  sclerotia 

2.  Vertical  section  of  colourless  sclerotium 

3.  Vertical  section  of  normal  sclerotium. 


■  I  Swift 


Diagrammatic, 
obj.  No.  4  eyepiece. 


together  into  a  thin  concave  crust.  The  contact  surfaces  of  both  forms  are  colourless, 
but  this  is  merely  due  to  lack  of  aeration,  for  if  the  sclerotia  be  removed  so  that  all 
surfaces  have  access  to  air,  the  previously  colourless  contact  surfaces  develop  the 
tissue  containing  the  characteristic  pigment.  If,  however,  the  sclerotia  be  cut 
across  so  that  the  inner  tissue  zones  are  exposed,  these  do  not  develop  pigment. 
The  power  of  colour  formation  is  therefore  confined  to  the  peripheral  cells.  Save 
for  the  absence  of  pigment  in  these  cells,  the  most  minute  examination  of  the  structure 
and  form  of  the  sclerotia  of  the  parental  strain  and  its  aberrant  form  failed  to  reveal 
any  difference. 

Like  the  physiological  examination  the  comparison  of  the  morphological  characters 
of  the  two  forms  under  discussion  has  greatly  increased  value  when  it  is  realised  that 
this  work  was  only  part  of  a  wider  series  of  experimental  researches,  which 
functioned  as  controls,  and  the   results  of  which  were  invaluable  in  aiding  in  the 
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interpretation  of  doubtful  phenomena.  The  conclusion  drawn  from  the  entire  study  was 
that  the  parental  strain  and  the  aberrant  derivative  differed  in  respect  only  to  the 
one  character  of  sclerotial  pigmentation.  The  phenomenon  would  thus  appear  to 
form  an  exact  parallel  with  mutations  by  loss  of  single  genes  in  the  higher 
organisms,  and  it  would  stand  in  contrast  with  the  fungal  mutations  described  by 
Arcichovskij  (2),  Schiemann  (69),  and  Waterman  (83),  in  which  there  was  simul- 
taneous variation  in  several  independent  respects. 

The  explanation  of  an  apparent  genoty])ic  alteration  on  the  basis  of  such  a  profound 
change  as  the  sheer  "  dropping  out  "  of  an  element  from  a  most  delicately  balanced 
reaction-system  would  appear  out  of  the  question.  I  can  only  visualise  the  organism 
as  a  unity  in  itself,  in  the  sense  that  it  consists  of  a  very  great  number  of  elements, 
each  an  elaborate  reaction-system,  which  bear  a  specific  and  causal  relationship  to 
one  another.  The  continued  harmonious  functioning  of  the  total  system  minus  one 
of  its  parts  is  to  me  unthinkable. 

Pure  lines  of  organisms  are  often  separated  by  characters  which  are  extremely 
minute  and  difficult  to  detect,  as  for  example  the  obscure  serological  reactions  which 
differentiate  certain  races  of  bacteria,  and  it  would  appear  probable  that  were  the 
analysis  in  the  present  case  carried  to  a  further  degree  of  refinement,  many  deeply 
underlying  and  elusive  physiological  differences  between  the  two  strains  would  be 
detected.  Such  work,  however,  would  constitute  a  most  laborious  and  intensive 
study  quite  beyond  the  scope  of  the  present  paper. 

5.  Origin  of  Colourless  Form.     Special. 

A  question  of  some  considerable  importance  is  the  exact  point  of  origin  of  the 
colourless  strain.  As  described  it  apparently  arose  in  the  single  colourless  sclerotium 
found  in  tube  (E.  G),  and  for  long  no  doubt  was  entertained  that  this  was  its  actual 
point  of  inception.  In  the  absence  of  further  evidence,  however,  there  was  no 
certainty  of  this,  for  the  colourless  strain  would  only  be  observed  on  the  production 
by  it  of  sclerotia,  and  if  sclerotia  were  not  formed  the  presence  of  this  strain  could 
not  even  be  suspected. 

Cultures  (D.  1-12)  had  been  made  from  the  pedigree  stock  (C.  1)  in  the  usual  way, 
by  transference  of  spores  on  a  platinum  wire,  and  a  large  and  indefinite  number  of 
these  reproductive  bodies  would  tlius  be  implanted  in  each  tube. 

Approximately  100  per  cent,  of  these  spores  would  germinate,  and  the  resulting 
growth  would  therefore  comprise  very  many  intermingled  mycelia,  some  or  all  of 
which  would  produce  conidiophores  and  spores.  Similarly,  the  six  tubes  (E.  1-6)  of 
August  13,  prepared  in  the  same  way  from  one  (D.  6)  of  the  previous  twelve,  would 
each  contain  an  indefinite  number  of  mycelia,  each  mycelium  corresponding  to  a 
single  individual.  Each  tube  might,  therefore,  contain  one  hundred  or  more 
individuals  of  identical  genotypic  constitution,  gi'owing  inextricably  mixed  together 
and  all  or  few  producing  spores  and  later  sclerotia. 
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Furthermore,  the  first  crop  of  spores  produced  from  the  myceha  in  the  tube  may 
fall  to  the  surface  of  the  medium,  immediately  germinate,  and  two  or  three  days 
later  again  reproduce,  when  the  same  process  may  be  repeated.  Thus  in  each  test- 
tube  generation  there  may  be  two  or  more  lineal  generations,  depending  partly  upon 
the  conditions  to  which  the  growth  is  subjected,  and  partly  upon  the  available 
germinating  surface  of  the  medium. 

From  the  pedigree  stock  (C.  1)  to  the  culture  of  August  13  (E.  6)  containing  the 
first  colourless  sclerotium  there  were  present  three  direct  test-tube  generations, 
which  may  actually  consist  of  from  about  six  to  twenty  hneal  generations,  and  at 
some  one  point  in  this  series  of  life  cycles  the  aberrant  strain  may  have  arisen.  In 
the  series  of  cultures  under  consideration  there  was  no  evidence  that  any  accessory 
spore  form  was  produced,  nor  were  the  conditions  such  as  would  be  conducive  to  their 
formation.  A  brief  discussion  of  the  two  possible  points  of  origin,  the  spore  and  the 
sclerotium,  is  perhaps  desirable. 

C.  Spore  Formation. 

It  will  be  clear  that  the  colourless  strain  may  have  arisen  as  a  variation  in  a  single 
spore  at  any  point  between  the  second  lineal  generation  in  tube  (C.  1)  and  the  last 
lineal  generation  in  tube  (E.  6).  If  for  example,  in  the  former,  none  of  the  offspring 
of  this  spore,  although  present  in  the  later  cultures,  produced  sclerotia  until  a 
favourable  opportvmity  occurred  in  tube  (E.  6).  As,  for  essential  purposes,  the 
colourless  form  differs  from  the  parental  form  only  in  the  absence  of  sclerotial 
pigmentation,  the  presence  of  the  former  could  not  possibly  be  ascertained  in  a 
cultui'e  of  the  latter  in  the  absence  of  the  formation  of  its  sclerotia.  If,  therefore, 
the  aberrant  form  arose  as  a  single  spore  variation,  it  is  impossible  to  locate  exactly 
its  point  of  origin. 

(Jertain  facts  relevant  to  the  foregoing  must,  however,  be  noted.  It  has  been  an 
invariable  technique  throughout  the  whole  of  the  research,  when  transferring 
inoculum  from  one  tube  to  another,  to  take  this  either  from  a  single  conidiophore  or, 
preferably,  from  a  single  germinated  sclerotium.  Now,  every  conidiophore  and  each 
sclerotium  arises  not  oidy  from  a  single  original  s]Jore,  but  actually  takes  oi'igin  in  a 
single  mycelial  cell.  It  will  be  obvious  that  this  very  considerably  narrows  the  range 
wherein  the  aberrant  form  may  have  arisen  as  a  spore  saltation. 

Furtliermore,  from  its  inception  the  colourless  strain  lias  been  constantly  repro- 
duced by  means  of  spores,  and  if  it  arose  as  a  spore  variation,  the  very  existence  of 
the  new  form  implies  that  this  particular  spore  gemninated  and  produced  a  mycelium 
giving  rise  to  conidiophores  and  spores,  the  latter  possessing  the  new  potentiality. 
But  a  single  mycelium  gives  rise  to  an  immense  number  of  conidiophores,  and  the 
number  of  spores  produced  on  even  a  single  conidiophore  is  very  great.  It  is,  therefore, 
almost  unthinkable  that  the  new  character  should  not  again  have  appeared  in  any  of 
the  direct  and  lateral  cultures  made  from  tube  ( E.  G)  and  from  the  previous  generations. 
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For  purposes  other  than  this  particular  piece  of  investigation  it  was  decided  to  test 
the  probability  of  the  "throwing  up"  of  the  colourless  strain  in  a  parental  culture, 
when  only  a  single  spore  of  the  former  is  present.  Accordingly,  inoculum  from 
(E.  4)  was  transferred  to  a  clean  tube-slant  and  streaked  over  the  surface,  after  which 
there  was  inserted  a  single  spore  of  the  colourless  strain.  The  circumstances  would 
thus  correspond  in  all  essentials  to  those  in  tube  (E.  6)  if  the  aberrant  form  had 
arisen  as  a  variation  in  a  single  spore.  The  results  arranged  diagrammatically  are 
shown  in  Table  II. 

Tube  (I)  was  prepared  on  March  2  and  ten  days  later,  when  the  sclerotia  were 
mature,  it  was  found  that  of  the  23  sclerotia  in  the  culture  2  were  colourless,  the 
aberrant  strain  thus  "  throwing  up "  in  the  first  tube  generation.  On  March  9 
cultures  (II.  1-6)  were  taken  from  (I)  and  ten  days  later  it  was  found  that  2  of  these 
contained  sclerotia  of  both  strains,  the  remaining  4  containing  only  black  sclerotia. 
In  (II.  1)  there  were  15  black  and  1  colourless  sclerotium,  and  in  (II.  2)  27  black  and 
1  colourless  sclerotium.  From  eacli  of  the  tubes  containing  only  black  sclei'otia 
(II.  3-6)  six  cultures  were  made  (III.  1-24).  Of  these  24  cultures  9  contained  both 
parent  and  colourless  form  and  the  remainder  black  sclerotia  only.  In  the  former 
there  were  162  black  and  17  colourless  sclerotia.  From  each  of  the  15  tubes 
containing  only  black  sclerotia  6  cultures  (IV.  1-90)  were  prepared.  Of  these, 
24  tubes  contained  both  black  and  colourless  sclerotia  in  the  proportion  of  528  of  the 
former  to  39  of  the  latter,  and  the  remaining  66  tubes  contained  only  black  sclerotia. 
In  the  6  tubes  (IV.  79-84),  derived  from  (III.  17),  only  black  sclerotia  were  present, 
and  it  appeared  that  in  this  line  the  colourless  strain  had  "  run  out."  To  test  if  this 
were  so,  6  cultures  (V.  1-36)  were  made  from  each  of  these  tubes.  In  at  least  1  tube 
in  each  set  of  6  the  colourless  strain  was  again  thrown  up,  there  being  in  all  10  tubes 
containing  both  kinds  of  sclerotia  and  26  tubes  containing  only  the  black  form.  The 
work  was  not  carried  further  tlian  this  ;  but  in  the  total  experiment,  which  repre- 
sented 5  tube  generations  and  157  cultures,  the  aberrant  form  appeared  in  46  tubes. 
A  parallel  control  series  prepared  with  similar  inoculum  from  (E.  4)  was  synchronised 
with  the  foregoing,  similar  in  all  respects  save  that  the  single  spore  of  the  colourless 
strain  was  omitted.     All  tlie  sclerotia  were  black. 

It  will  be  clear  from  this  experiment  that  the  presence  of  a  single  spore  of  the 
aberrant  strain  in  a  parental  tube  may  cause  the  new  chai'acter  to  appear  in  the  first 
generation,  and  in  a  large  proportion  of  cultures  of  succeeding  generations.  As 
already  stated,  the  generations  (C.  1),  (D.  1-12),  and  (E.  1-6)  were  all  subcultured 
freely,  but  the  aberrant  strain  was  absent  from  all  descendants.  In  liearly  200  direct 
tube  generations  of  the  parent  strain  (F.  la)  the  aberrant  form  has  not  appeared. 
This  can  only  be  explained  by  the  hypothesis  that  the  colourless  strain  did  not  arise 
as  a  spore  variation,  but  arose  in  and  was  confined  to  the  initial  colourless  sclerotium, 
and  that  the  removal  of  this  eliminated  the  aberrant  strain  from  the  parental  culture 
(E.  6).      The  new  strain  was  thus  seen  immediately  on  its  inception  and  isolated. 
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D.  Sclerotial  Formation. 

It  will  be  evident  from  text-fig.  1  that  the  sclerotium  consists  of  a  mass  of  more  or 
less  firm  tissue,  enclosed  in  a  thin  layer  of  cells,  in  the  walls  of  which  the  black 
pigment  is  located.  Now,  unless  the  sclerotium  arises  in  a  single  hyphal  cell,  in 
which  the  variation  may  have  occurred,  the  physiological  change  resulting  in  loss  of 
colour  must  have  taken  place  simultaneously  in  all  those  peripheral  cells  which 
normally  would  form  pigment.  In  view  of  the  fact  that  the  new  character  is  strictly 
hereditary,  and  obtains  by  the  growth  of  any  portion  of  a  single  individual,  and  that 
the  initial  colourless  sclerotium  arose  in  a  tube  culture  containing  a  perfectly  normal 
growth,  such  a  deep-seated  physiological  change  occurring  simultaneously  in  a  very 
great  number  of  cells,  and  to  an  identical  degree  in  all,  is  a  process  extremely  difficult 
to  conceive. 

Very  little  is  known  of  the  initial  developmental  stages  in  the  foi-mation  of 
sclerotia,  but  the  latter  are  usually  considered  to  be  a  further  growth  or  modification 
of  haptera,  structures  which  are  very  accurately  delineated  in  the  works  of 
Marshall  Ward  (80)  and  Istvanffi  (33).  In  their  formation  one,  or  more  usually 
several  related  hyjihte  form  short  branches  on  which  arise  tufts  of  secondary  branches. 
These  are  directed  vertically  to  the  contact  surface,  and  the  clustered  hyphte  mat 
together  laterally  forming  a  solid  body  of  a  tassel-like  shape  and  structure. 

It  is  of  interest  to  note  that  DE  Bary(5,  6)  regarded  sclerotia  and  haptera  as 
discrete  bodies,  stating  that  although  clusters  of  haptera  "  have  been  mistaken  for 
sclerotia  .  .  .  they  have  no  connection."  Munn  (56),  who  has  most  recently 
investigated  this  fungus,  writes  :  "  It  was  observed  that  in  many  cases  the  appres- 
soria  are  incipient  stages  of  the  sclerotia  (Plate  3,  figs.  23,  24)."  Now  if  Munn's 
figs.  23,  24,  and  13,  which  represent  stages  in  the  development  of  sclerotia,  be 
compared  with  his  figs.  25  and  27,  which  show  the  development  of  haptera,  the 
essential  difference  is  plain  ;  and  this  distinction  is  obvious  in  the  many  delineations 
of  these  structures  by  previous  investigators.  There  are  no  transition  -stages.  The 
general  laboratory  study  of  the  fungus  and  the  relation  of  haptera!  and  sclerotial 
formation  to  various  media  and  known  conditions  show  that  the  structures  are  organs 
sui  generis.  This  was  confirmed  in  a  very  striking  way  by  the  development  of  these 
bodies  in  the  aberrant  strain.  The  haptera  of  the  normal  strain  very  rapidly  become 
dark  coloured  and  finally  almost  black,  and  this  similarity  of  colouring  with  the 
sclerotia  is  in  large  part  responsible  for  the  confusion.  In  the  case  of  the  colourless 
strain,  however,  it  is  most  helpful,  for  whilst  the  sclerotia  have  completely  lost  their 
pigment,  the  haptera  have  retained  it,  and  thus  in  the  one  culture  there  co-exist  the 
albinistic  sclerotia  and  brown  or  olive-black  haptera.  This  colour  difference  is 
distinctive  from  the  inception  of  the  two  structures. 

If  therefore  the  sclerotia  and  haptera  are  discrete  entities,  the  earliest  stages  that 
we  know  in  the  development  of  the  former  are  the  clusters  of  interweaving  hyphae 
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depicted  by  Istvanffi,  Munn,  and  others,  and  there  is  no  reason  why  these  should 
not  have  arisen  in  the  branching  of  a  single  mother-cell.  On  searching  selected 
cultures  both  by  sectioning,  and  more  successfully  by  mechanical  teasing  and  inh-a 
vitam  staining  with  neutral-red  and  methylene-blue,  the  initial  stages  were  soon 
found.  For  the  present  issue,  the  main  point  of  interest  is  that  each  sclerotium  arises 
symphogenetically  from  an  individual  cell  of  the  mycelium.* 

On  the  open  surface  of  a  soft  matrix  the  sclerotia  usually  remain  separate,  but  if 
developed  in  a  fissure  or  along  a  line  of  contact  they  often  fuse  to  form  a  continuous 
crust.  Figs.  50  and  51  on  Plate  15  of  Istvanffi's  memoir  (33)  are  to  be  interpreted 
in  this  latter  way.  If  mixed  inocula  of  parental  and  colourless  strains  be  grown, 
interesting  composite  sclerotial  crusts  may  very  rarely  be  obtained.  In  these  there 
is  no  intimate  mingling  of  hyphae,  but  merely  a  lateral  "  grafting,"  and  if  such  a 
composite  sclerotium  be  germinated,  the  coiiidia  and  conidiophores,  although  indis- 
tinguishable, each  reproduce  purely  their  original  parental  type. 

From  the  foregoing  discussion  it  will  be  clear  that  the  physiological  change  which 
resulted  in  the  production  of  the  aberrant  form  must  have  occm-red  in  the  mother- 
cell  which  gave  rise  to  the  initial  colourless  sclerotium  in  tube  (E.  6). 

G.  Nature  of  Loss  of  Colour. 

A  detailed  biochemical  study  of  sclerotial  pigmentation  is  beyond  the  scope  of  the 
present  paper,  but  one  may  perhaps  refer  to  certain  simple  experiments  which  throw 
some  light  upon  the  nature  of  the  physiological  change  which  has  occasioned  the 
j^roduction  of  the  colourless  form. 

Duiing  the  last  two  decades  the  pigmentation  processes  in  animals  and  plants  have 
been  the  subject  of  much  investigation,  and  although  tliis  has  followed  the  two 
distinct  lines  of  biochemistry  and  genetics,  the  results  have  complemented  each  other 
in  a  quite  remarkable  way.  The  general  trend  of  evidence  would  appear  to  lead  to 
the  view  that  the  pigmentation  processes  throughout  are  essentially  of  an  oxidatory 
nature  with  associated  subsidiary  or  perhaps  preliminary  reduction  processes. 

Albinos  have  been  shown  to  be  divisible  into  two  categories  termed  "  dominant 
whites  "  and  "  recessive  whites  "  respectively.  In  the  latter,  the  absence  of  pigment 
is  due  to  the  lack  of  either  the  oxidase  factor,  or  the  chromogen  factor,  or  of  both  ; 
wiiilst  in  the  former,  both  oxidase  and  chromogen  are  present,  but  their  interaction  is 
prevented  by  a  third  inhibiting  factor.  The  oxidatory  nature  of  the  pigmentation 
processes  is  common  to  so  many  widely  diverse  organisms,  that  one  might  with  some 
justification  expect  to  find  it  even  in  the  fungi,  and  consequently  to  find  also  that 
albinism  in  such  an  organism  as  Botrytis  cinerea  showed  common  features  with  the 
same  ])henomenon  in  rabbits  or  primulas.  This  fungus,  however,  is  asexual,  and  the 
terms  "dominant"  and  "recessive"  are  inapplicable;  but  if  the  pigmentation 
*  In  certain  strains  of  Botrytis  cinerea  the  sclerotia  originate  meristogenetically. 
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processes  are  parallel  to  those  of  other  organisms,  the  physiological  changes  which 
have  resulted  in  the  production  of  the  colourless  form  may  involve  either  the 
presence  of  a  new  inhibiting  factor  or  the  loss  of  either  oxidase  or  chromogen  or  both. 

The  simplest  and  most  direct  method  of  examining  the  condition  of  the  aberrant 
form  is  to  test  the  colourless  sclerotia  for  the  presence  of  oxidase.  If  this  be  present 
and  active  it  follows  that  the  chromogen  factor  must  be  absent  for  otherwise  pig- 
mentation would  have  ensued.  If  there  is  no  oxidase  reaction,  then  either  this  factor 
is  lacking  or  its  activity  inhibited  by  a  third  factor.  The  technique  was  based  on 
that  devised  by  Keeble  and  Armstrong  (41,  42,  43)  in  their  investigations  of 
flower  pigments.  Solutions  of  benzidine,  a-naphthol,  and  ^)-phenylenediamine  gave 
a  marked  reaction,  whilst  with  a  solution  of  guaiacum-resin  a  barely  perceptible 
lavender- blue  colour  was  obtained. 

These  reactions  were  unaltered  by  preliminary  treatment  of  the  sclerotia  with 
absolute  alcohol  or  hydrocyanic  acid,  or  by  the  subsequent  addition  of  hydrogen- 
peroxide,  and  in  all  cases  they  were  produced  in  a  few  minutes  irrespective  of 
temperature  differences  ranging  from  that  of  the  laboratory  to  incubation  at  37°  C. 

The  distribution  of  the  oxidase  in  the  colourless  sclei'otia  was  identical  in  every 
case,  and  was  noteworthy,  being  confined  to  the  walls  of  the  peripheral  cells  which  in 
the  parental  form  contain  the  normal  black  pigment.  So  exact  was  this  agreement 
that  some  of  the  sections  of  colourless  sclerotia  treated  with  benzidine  were  barely 
distinguishable  from  sections  of  normal  sclerotia. 

Whether  the  colourless  sclerotia  have  been  formed  in  light  or  darkness  makes  no 
difference  to  the  intensity  of  the  oxidase  reaction,  nor  is  this  changed  by  growth  of 
the  fungus  at  various  temperatures  or  upon  nutrient  substrata  presenting  very 
different  physiological  conditions. 

On  Wheldale's  view  (85)  the  chromogens  are  flavones  or  flavoual  glucosides.  The 
presence  of  these  bodies  in  tissues  may  readily  be  detected  by  the  bright  colour  given 
on  exposure  to  ammonia  vapour,  and  the  dull-green  (or  sometimes  red-brown)  colour 
with  ferric-chloride.  When  these  tests  were  applied  to  sections  and  pulps  of  the 
colourless  sclerotia  a  totally  negative  result  was  obtained. 

Neilson  Jones  (39)  has  shown  that  even  in  certain  recessive  whites  both  cliromogen 
and  oxidase  appear  to  be  present,  but  that  owing  perhaps  to  their  discrete  distribution 
interaction  is  prevented.  To  test  the  possibility  of  this  explanation  of  the  albinism 
of  the  aberrant  strain  a  number  of  sclerotia  were  boiled  in  5  c.c.  of  50  per  cent,  alcohol 
to  destroy  the  oxidase,  pulped,  filtered  and  the  filtrate  evaporated  to  a  few  drops. 
Sections  of  the  colourless  sclerotia  were  then  immersed  in  this  and  incubated  at  27°  C. 
No  trace  of  oxidase  reaction  was  observed,  confirming  the  absence  of  chromogen  in  the 
sclerotia. 

True  brown  and  black  pigments  are  of  the  very  greatest  rarity  in  the  higher  plants. 
MoBius  (52)  however  has  shown  that  they  do  exist  as  in  the  spots  on  the  alee  of 
Vicia  Faba,  but  even  here  the  colouring  matter  is  dissolved  in  the  cell-sap. 
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In  thin  sections  of  the  norma,!  sclerotia  of  Botrytis  cinerea  the  iiigment  appears  as 
a  true  brown-black.  Furthermore,  as  shown  in  text-fig.  1,  it  is  contamed  m  the 
substance  of  the  cell-walls  and  not  in  the  cell-content  which  is  perfectly  hyaline.  It 
thus  differs  very  markedly  from  the  usual  plant  pigments,  and  would  appear  to  be 
more  closely  related  to  the  melanotic  pigments,  certain  of  the  melano-proteins 
existing  dissolved  in  the  keratin  structures. 

The  mother  substance  of  melanin  is  generally  supposed  to  be  tyrosin,  this 
colourless  chromogen  being  converted  into  a  black  pigment  by  the  oxidising  enzyme 
tyrosinase.  If,  therefore,  the  black  pigment  in  the  normal  sclerotia  be  melanin,  the 
aberrant  sclerotia  which  presumably  lack  the  factor  for  tyrosin  should  exhibit  a 
tyi-osinase  reaction.  The  tests  for  oxidase  already  referred  to  are  only  for  the 
detection  of  phenolase  or  laccase  which  Bourquelot  and  Bertrand  (14),  Zellner  (88), 
Pringsheim  (63),  Kastle  (40)  and  others  have  shown  to  be  almost  universally  present 
in  fungi. 

Tyrosinase  is  comparatively  rare  in  its  distribution  in  fungi.  Bertrand  (11), 
however,  has  shown  its  presence  in  species  of  Russula,  especially  Russula  nigricans, 
and  LuTZ  (49)  in  the  stipes  and  pilei  of  Gyromitra  gigas  and  Dlsciotis  parlata  ;  while 
more  recently  it  has  been  demonstrated  in  Lenzites  scejnaria  by  Zeller  (87)  and  in 
various  species  of  Actinomyces  by  Drechsler  (24). 

According  to  Bourquelot  and  Bertrand  (14)  the  best  substrate  for  tyrosinase  is 
tyrosin.  Colourless  sclerotia  incubated  at  18°  C,  25°  C,  and  37°  C.  for  14  days  in  a 
saturated  aqueous  solution  of  silk-tyrosin  gave  a  totally  negative  reaction,  nor  could 
this  be  changed  by  preliminary  treatment  with  alcohol,  ferrous  sulphate  or  hydro- 
cyanic acid  or  by  subsequent  treatment  with  hydrogen  peroxide.  A  pulp  of  sclerotia, 
or  sections,  gave  a  similar  result  with  this  treatment. 

Chodat  and  Staub  (20)  have  advocated  the  use  of  p-cresol  with  the  addition  of 
glycin  for  the  detection  of  the  enzyme  tyrosinase.  Sclerotia  immersed  in  this  and 
incubated  at  37°  C.  showed  after  many  days  a  diffuse  pale-yellow  tint  instead  of  the 
rich  violet-blue  reaction  characteristic  of  tyrosinase. 

In  1907  AbderhaI;DEN  and  Guggenheim  (1)  showed  that  N/100  hydrochloric  acid 
permanently  destroyed  the  activity  of  tyrosinase,  but  in  the  present  case  an 
immediate  and  direct  oxidase  reaction  with  benzidine  was  obtained  after  preliminary 
treatment  of  the  colourless  sclerotium  with  N/40  hydrochloric  acid,  although  a  strength 
of  N/30  destroyed  its  activity. 

It  was  found  by  Bertrand  (11)  that  tyrosinase  could  be  differentiated  from  other 
oxidising  enzymes  by  its  lower  lethal  temperature,  its  activity  being  inhibited  at 
50°  C.  The  oxidase  in  the  colourless  sclerotia  is  inhibited  only  by  a  temperature  of 
from  80°  C.  to  85°  C. 

Furthermore  Bach  (3,  4)  has  shown  that  the  pigment  formed  by  the  action  of 
tyrosinase  upon  tyrosin  may  be  oxidised  further  to  a  colourless  compound  by  a  dilute 
acid  solution   of  potassium  permanganate.     The  normal  black  sclerotia  of  Botrytis 
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cinerea  remained  unaltered  when  treated  in  this  way.  The  black  pigment  is  indeed 
extraordinarily  stable,  remaining  apparently  unchanged  after  boiling  in  concentrated 
acids  and  alkalies.  When  treated  thus  the  stuff  of  the  tissues  disappears  whilst  the 
pigment  remains  as  a  most  delicate  gossamer-like  cellular  film.  This  stability 
immediately  distinguishes  the  sclerotial  pigment  of  Botrytis  from  other  black  fungal 
pigments  such  as  the  Aspergillin  of  Aspergillus  niger  which  is  sohible  in  hot  water,  or 
the  colouring  matter  of  Daldinia,  Ilypoxylon  and  various  other  carbonaceous 
Ascomycetes,  which  is  soluble  in  alcohol. 

The  foregoing  simple  experiments  only  show  (a)  that  the  colourless  sclerotia  possess 
a  direct  oxidase  system  of  the  laccase  type  in  a  region  corresponding  exactly  with  that 
in  which  the  black  pigment  is  present  in  the  sclerotia  of  the  parental  strain  ;  and  (h) 
that  flavones  or  chromogens  are  lacking  in  the  ci^lourless  sclerotia,  or  if  present,  they 
are  inactivated  in  some  way  not  understood.  The  experiments  do  not  prove  that 
the  pigment  in  the  sclerotia  of  the  [larental  strain  is  due  to  the  oxidation  of  a 
chromogen  or  that  the  aberrant  strain  has  arisen  by  the  loss  or  inactivation  of  a  factor 
for  chromogen.  When,  however,  the  above  results  are  considered  along  with  those 
of  GoRTNER  (32),  Keeble  and  Armstrong  (41,  42,  48)  and  others,  which  cumulatively 
form  such  positive  circumstantial  evidence,  it  is  somewhat  difBcult  to  resist  drawing 
the  deduction  that  the  albino  strain  of  Botrytis  cinei'ea  has  indeed  arisen  from  the 
black  piirent;  J  form  by  the  loss  or  modification  of  a  factor  for  chromogen. 

7.  Significance  of  Colourless  Form. 

A.  Discussion  of  Recorded  Mutations. 

The  essential  facts  in  the  preceding  pages  may  be  sununarised  as  follows  :  The 
colourless  form  arose  spontaneously  witliout  any  evident  relation  to  external  con- 
ditions or  stimuli,  in  a  single  sclerotium  of  a  culture  which  on  accepted  criteria  was  a 
pure  line.  The  new  form  is  apparently  differentiated  from  its  parent  in  respect  to  a 
single  character  only,  that  of  colour,  and  this  albinism  would  appear  to  be  associated 
with  the  absence  of  a  factor  for  chromogen.  The  change  has  occiu'red  once  only, 
and  has  given  rise  to  a  form  unknown  in  nature  and  perfectly  constant  under  all 
conditions.  It  would  seem  possible  to  place  only  one  interpretation  on  these  facts, 
that  of  mutation,  and  accordingly  when  I  was  privileged  to  exhibit  this  form  to  the 
Linnean  Society  of  London  on  April  3,  1919,  I  described  it  as  "an  albino  mutant  of 
Botrytis  cinerea." 

Previous  records  of  mutation  in  the  fungi  fall  into  two  groups.  In  the  first  are 
those  changes  in  genetic  constitution  which  appear  to  bear  a  direct  and  purposive 
relation  to  certain  conditions  which  have  operated  during  the  development  of  the 
organism  or  that  of  the  preceding  generation.  As  an  illustration  may  be  instanced 
the  permanent  acquirement  of  the  power  to  ferment  lactose  by  an  organism  pre- 
viously unable  to  foi-m  lactase,  this  change  being  brought  about  by  the  growth  of 
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the  organism  or  its  progenitor  upon  a  lactose  medium.  Tlie  second  category  of 
mutations  includes  those  cases  in  which  the  genetic  change  is  apparently  quite 
fortuitous,  and  this  group  is  divisible  into  two  sub-groups.  In  the  first  of  these  the 
change  is  associated  with  certain  conditions  which  interfere  with  the  normal  metabolic 
reactions  of  the  organism.  For  example,  by  treating  the  organism  with  toxic 
substances,  or  by  subjecting  it  to  extremes  of  temperature  or  desiccation,  various 
indeterminate  morphological  or  physiological  changes  are  induced,  such  as  alterations 
in  spore  dimensions  or  coloration,  or  growth  upon  standardised  media.  In  the 
second  sub-group  the  changes  occur  spontaneously  under,  so  far  as  may  be  judged, 
perfectly  normal  conditions  of  development,  and  this  category  would  include  the 
colourless  form  of  Botrytis  cinerea. 

Of  the  purposive  mutations  the  classical  case  perhaps  is  that  described  by 
Massee(51),  in  which  Tricothecium  roseum  was  changed  into  a  virulent  parasite 
by  habituating  it  to  a  previously  immune  host.  This  research  to  which  credence  is 
still  liirgely  given  was,  even  for  the  year  1905,  so  inexact  in  all  critical  details  that 
it  may  be  now  be  regarded  as  of  merely  historical  interest.  Ptesults  worthy  of 
serious  consideration  are  described  in  the  researches  of  Salmon  (66,  67)  on  the 
adaptive  parasitism  of  Krysiplie  Gramitds  by  means  of  growth  upon  bridging  hosts, 
or  the  parallel  results  obtained  with  various  rust-fungi  by  Marshall  Ward  (81,  82), 
Freeman  (28,  29),  Johnson  (38),  Pole  Evans  (62),  and  others.  In  none  of  this 
work,  however,  were  the  experiments  carried  out  with  single-spore  strains  of  the 
fungi,  under  the  rigidly  controlled  conditions  imperative  in  such  work.  Nor  was  it 
realised  until  the  more  recent  researches  of  Mains  (50),  Brooks  and  Cooley(15), 
Lauritzen  (46),  and  others,  how  vital  are  the  eftects  of  even  minute  changes  in  the 
light,  temperature,  age,  or  humidity  relations  of  the  host  plants.  The  findings  of 
KiDD  and  West  (44)  on  the  importance  of  physiological  predetermination  in  the  life 
of  the  plant  may  also  have  a  very  material  bearing  on  questions  of  relative  suscepti- 
bility or  immunity  of  host  plants  to  fungal  attack,  and  indicate  factors  which  will 
be  extremely  difficult  to  control.  Apart,  however,  from  such  collateral  evidence  as 
this,  the  value  of  bridging  species  in  changing  the  physiological  characters  of  the 
fungus  is  not  supported  by  the  work  of  BiFFEN  (12),  and  has  been  directly  con- 
travened by  the  researches  of  Reed  (64),  and  of  Stakman  and  his  associates  (71-74). 
After  much  intensive  study  of  this  subject,  the  latter  conclude  "...  the  writers 
have  not  been  able  to  detect  any  mutation  nor  to  induce  perceptible  evolutionary 
changes  experimentally."  With  rare  exce]itions  such  conclusions  are  supported  by 
the  invariable  experience  of  all  the  critical  and  accurate  workers  in  more  recent 
experimental  mycology,  and  there  would  appear  to  be  little  doubt  that  when  a 
pure  line  organism  is  examined  by  a  careful  investigator  under  rigidly  controlled 
conditions,  heritable  alterations  bearing  a  purposive  relation  to  educative  treatment 
are  absent. 

The  lecorded  instances  of  non-purposive   genetic  changes  in  organisms  induced  by 
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exposure  to  unusual  conditions  are  very  few  in  number.  In  1908  Arcichovsku  (2) 
described  a  form  of  Aspergillus  niger  witli  yellow-brown  spores  which  arose  in  a 
culture  of  the  black  form  growing  in  Raulin's  fluid  to  which  0*000 1  per  cent,  of  zinc 
sulphate  had  been  added.  Four  years  later  Schiemann  (69)  obtained  various  mutants 
of  Aspergillus,  niger  by  treatment  of  cultures  with  potassium  bichromate,  exposure 
to  high  temperatui'es,  and  so  forth. 

In  the  same  year  Waterman  (83)  caused  Aspergillus  niger  to  mutate  by  treatment 
with  2  per  cent,  galactose,  rhamnose,  or  glucose,  1  per  cent,  boric  acid,  ^-oxybenzoic 
acid,  or  dichloracrylic  acid.  A  fungus  which  the  author  speaks  of  as  Penieillium 
glaucum  was  also  caused  to  throw  mutants  l^y  treatment  with  any  of  the  following 
substances  : — -^-oxybenzoic  acid,  salicylic  acid,  trichloracrylic  acid,  tetrachlor- 
propionamid,  pentachlor-propionamid,  and  pyrocatechetic  acid  (pvrocatechuic  ?). 

Tn  all  the  above  cases  the  new  forms  described  differed  from  the  parental  fungi 
in  many  morphological  and  physiological  characters,  the  most  obvious  being  that  of 
the  colour  of  the  spores.  Furthermore,  the  production  of  the  mutant  forms  was  not 
a  constant  and  specific  reaction  to  certain  definite  chemical  stimuli,  but  was  merely 
a  fortuitous  circumstance  resulting  from  the  generally  unfavourable  conditions  to 
which  the  organisms  had  been  subjected.  These  were  of  such  diverse  nature  that 
one  might  expect  that  almost  any  substance  deleterious  to  growth  or  interfering 
with  the  metabolic  processes  in  any  way  would  produce  similar  results.  Also  not 
every  culture  subjected  to  these  conditions  gave  uniform  results  over  the  entire 
growth,  but  only  occasional  cultures,  and  in  these  cultures  only  sporadic  individuals, 
conidiophores  or  spores.  Moreover,  as  Schiemann  (69)  points  out,  mutations  were 
also  produced  when  the  fungi  were  developing  under  favourable  conditions. 

Such  results  as  these  would  have  a  somewhat  indifferent  reception  if  obtained  in  a 
physical  or  chemical  laboratory.  Their  anomalous  natiu'e  is  stated  clearly  in 
Schiemann's  paper  as  follows  :  "  Die  konstanten  Farbmutatiouen  sind  beide  auf  mit 
KoCr.iO^  versetzten  Nahrboden  aufgetreten.  Von  einer  spezifischen  Wirkung  des 
K.jCroOy  zu  sprechen,  ist  trotzdem  nicht  moglich.  Schon  der  Umstand,  dass  der 
Farbumschlag  unter  liber  100  Kulturen  mit  ^^Qvcfdr.  nur  zweimal  stattfand,  hier 
wiederum  nicht  die  ganze  Decke  betraf,  sondern  unter  vielen  hunderten  von  Kopfchen 
nur  einzelne,  spricht  dagegen.  Ferner  hat  die  Pro^e^/s-Mutante  dieselbe  Richtung 
eingeschlagen  ;  diese  aber  ist  aus  Hitzekulturen,  einmal  mit,  einmal  ohne  Zusatz  von 
K.iCr.iO,^  hervorgegangen.  Endlich  sind  nach  braun  abiindernde  Kulturen  des 
Aspergillus  niger  auch  sonst  gelegentlich  beobachtet  worden — -wenn  auch  selten — ohne 
dass  das  Chroma!  zur  Verwendung  kam." 

In  a  discussion  of  the  causes  of  genetic  variation,  Bateson(7)  has  written  :  "The 
state  of  knowledge  of  this  whole  subject  is  .  .  .  most  unsatisfactory,  chiefly  for  the 
reason  that  in  none  of  the  cases  which  are  alleged  to  show  a  positive  result  have  two 
observers  been  over  the  same  ground,  or  as  yet  confirmed  each  other  ...  I  do  not 
know  a  single  case  which  has  been  established  and  confirmed  in  such  a  way  that  we 
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could  with  conKdence  expect  to  witness  the  alleged  phenomena  if  we  were  to  repeat 
the  experiment." 

In  the  work  of  Arcichoyskij  (2),  Schiemann  (69),  and  Waterman  (83),  mutant 
forms  were  produced  so  easily  and  with  such  a  diversity  of  stimuli  and  simplicity  of 
technique,  that  it  seemed  probable  that  a  repetition  of  their  experiments  might 
produce  the  necessary  confirmatory  evidence.  Accordingly  single-spore  cultures  of 
two  strains  of  Aspergillus  niger  and  of  two  strains  of  Fenicillium  Italicum  were 
prepared.  (Waterman  used  "  Penicillium  glaucum,"  but  as  Thom  (77)  points  out, 
this  "  name  as  used  at  present  seems  to  be  applied  collectively  to  the  common  green 
forms,  which  under  examination  are  (piickly  found  to  be  not  one  but  several  species.") 
With  these  fungi,  the  experiments  of  Schiemann  (69)  and  Arcichovskij  (2),  and 
certain  of  those  of  Waterman  (83)  (galactose,  glucose,  rhamnose,  boric  acid,  and 
salicylic  acid)  were  carefully  repeated.  Although  modifications  in  the  colour  of  the 
spores,  general  morphological  facies,  and  physiological  reactions  were  observed,  these 
afi'ected  the  whole  growth  operated  upon  and  to  an  equal  degree  ;  but  the  repro- 
ductive bodies  of  the  modified  fungi,  when  returned  to  the  original  conditions,  gave  in 
every  case  the  original  result.  These  changes  were  phenotypic  and  not  genotypic, 
and  throughout  the  whole  series  of  experiments  not  a  single  mutation  occurred. 

More  recently,  experiments  have  been  carried  out  with  another  strain  of  Aspergillus 
niger  isolated  from  the  air,  and  with  a  strain  of  Penicillium  (roseum  f )  from  the  soil, 
both  these  having  been  in  culture  in  the  laboratory  for  several  months  and  repeatedly 
subcultured,  although  not  derived  from  single  spores.  The  fungi  were  subjected  to 
rather  gross  treatment  with  various  dilutions  of  toxic  substances,  such  as  silver 
nitrate,  copper  sulphate,  mercury  bichloride,  lead  nitrate,  potassium  acid  phosphate, 
sodium  chloride,  and  so  forth,  and  then  platings  were  made  from  these  tubes.  The 
resulting  growths  were  perfectly  normal. 

In  all  work  on  the  induction  of  genotypic  alterations  in  micro-organisms,  it  is 
extremely  difficult,  even  when  the  most  scrupulous  attention  is  given  to  every  detail, 
to  be  ,quite  certain  that  one  has  under  control  each  of  the  almost  infinite  number  of 
factors  which  may  influence  the  results.  Mycologists  are  only  now  beginning  to 
learn  how  extremely  complex  and  diflicult  to  isolate  such  factors  may  be,  and  tlie 
researches  of,  for  example,  Duggar(25)  and  his  collaborators  on  the  physiology  of 
certain  fungi  grown  in  culture  media,  or  those  of  CooNS  (21)  on  the  factors  involved 
in  the  growth  and  pycnidium  production  of  Plenodomus  fuscomaculans,  indicate  the 
complexities  underlying  apparently  the  most  simple  physiological  phenomena.  Not 
tlie  least  interesting  of  the  many  experimental  studies  throwing  light  on  the  bionomics 
of  Asjwrgillus  niger  are  those  of  Javillier  (34)  and  of  Steinberg  (75),  in  which  it  is 
shown  how  even  such  an  unsuspected  factor  as  the  different  quantities  of  zinc  present 
in  the  glass  of  flasks,  test-tubes,  and  other  apparatus  may  vitiate  comparative 
cultural  studies  of  this  fungus. 

If,   in    the    work   of  Arcichovskij  (2),    Schiemann  (69),    and    Waterman  (83), 
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accidental  contamination  did  not  occnr,  one  is  almost  forced  to  conclude  that  some 
accessory  factor  of  an  unrecognised  nature  must  have  been  operative  in  their  experi- 
ments ;  a  factor  absent  in  my  own  repetition  of  their  work  and  absent  from  the  studies 
of  every  other  mycologist  who  has  carried  out  critical  experimental  investigations  on 
these  fungi. 

Furthermore,  as  has  been  pointed  out,  these  investigators  emphasise  the  fact  that 
their  mutations  are  not  of  the  nature  of  specific  reactions  to  definite  chemical  or 
physical  stimuli,  but  only  the  results  of  a  generalised  interference  with  the  normal 
course  of  life  of  the  organism.  Thus  Schiemann  (69)  says:  "...  so  zeigt  sich  als 
ein  Gemeinsames,  dass  in  alien  Fallen,  wo  derartige  Farbiinderungen  auftrateu, 
Storungen  der  normalen  Lebens-verhiiltnisse  vorlagen.  Es  ist  deshalb  der  Schluss 
berechtigt,  dass  die  Mutation  in  den  beobachteten  Fallen  durch  einen  starken  Reiz 
ausgelost  wurde."  But  the  experience  of  other  investigators  demonstrates  the 
.  extreme  genetic  constancy  of  species  of  these  fungi  even  when  placed  for  many 
generations  under  the  most  unfavourable  conditions.  Numerous  workers  have  shown 
that  the  morphological  facies  and  physiological  activities  of  fungi  may  within  certain 
limits  be  changed  at  will  in  the  single  generation,  but  in  no  case,  with  the  above 
exceptions,  have  these  phenotypic  modifications  had  any  efl:ect  whatever  on  the 
genotypic  constitution  of  the  organisms. 

B.  Alternative  Interpretations. 

If  the  recorded  cases  of  mutation  in  the  fungi  need  not  be  accepted — and  for  my 
own  part  I  do  not  feel  compelled  to  accept  them- — how  then  is  one  to  regard  the 
colourless  strain  of  Botrytis  cinerea,  which  has  been  described  in  the  present  paper  ? 
Its  fortuitous  origin  as  a  single  sclerotium  in  a  perfectly  normal  single-spore  strain, 
whose  development  since  its  commencement  is  known,  has  already  been  emphasised. 
Under  no  conditions  whatever,  so  far  as  is  known,  may  the  black  sclerotial  pigment  of 
the  parent  strain,  or  the  albinism  of  its  derivative,  be  modified,  and  although  other 
characters  may  show  plasticity,  no  instance  of  a  permanent  alteration  has  been 
observed.  Does  the  phenotypic  constancy  of  the  parental  sti'ain  under  constant 
conditions  necessarily  imply  genotypic  purity  ? 

In  the  preceding  account  the  parental  strain  has  been  spoken  of  as  a  "  pure  line," 
and  on  aU  accepted  criteria  this  description  is  merited,  for  not  only  is  it  a  single  spore 
strain  of  an  asexual  organism,  but  it  has  been  reproduced  for  most  of  its  generations 
either  by  spores  from  single  conidiophores  or  from  single  germinated  sclerotia,  both  of 
which  take  origin  in  a  single  cell.  In  his  work  on  '  Evolution  by  means  of 
Hybridisation,'  Lotsy  (48)  rightly  insists  that:  "Certainty  of  purity  is  a  conditio 
sine  qua  non  to  obtain  proof  of  the  existence  of  mutation  in  living  beings,  just  as 
chemical  purity  is  a  conditio  sine  qua  non  to  obtain  proof  of  the  existence  of  mutability 
of  the  elements."  Is  there  certainty  of  purity  in  the  parental  strain  of  Botrytis 
cinerea  which  gave  rise  to  the  colourless  form  ?     Further,  is  the  fulfilment  of  such  a 
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criterion  possible  in  the  fungi  (or  bacteria),  and  particularly  in  those  forms  for  which 
mutation  is  claimed  ? 

The  only  positive  contribution  to  this  subject  is  the  work  of  Burgeff  (18), 
published  in  1914,  in  which  the  results  of  the  crossing  of  various  forms  of  Phycomyces 
nitens  are  described.  This  fungus  possesses  a  coenocytic  mycelium,  containing 
numerous  scattered  nuclei.  In  the  asexual  form  of  reproduction  spores  are  delimited 
within  sporangial  heads  into  which  there  have  passed  an  indefinite  number  of  nuclei. 
The  multinucleate  spores  germinate  and  reproduce  the  coenocytic  mycelium.  If, 
therefore,  the  original  hyphse  are  genetically  impure,  this  condition  will  be  maintained 
in  all  succeeding  generations,  for  the  sporangiospores  merely  reproduce  the  genetic 
condition  of  the  hyphse  which  give  rise  to  the  sporangia.  Opportunity  for  genetic 
contamination  occurs  at  sexual  reproduction,  for  this  process  is  merely  a  fusion  of  two 
multinucleate  gametes  to  form  a  multinucleate  zygospore,  which  on  germination  gives 
rise  to  a  coenocytic  mycelium  containing  nuclei  of  both  parental  strains.  There  will- 
be  an  equal  chance  for  both  types  of  nuclei  to  pass  into  the  sporangia  and  be  included 
in  the  multinucleate  spores.  A  single-spore  strain  may  thus  be  heterocaryotic.  If 
now  this  form  at  sexual  maturity  fuses  with  a  third  form,  and  so  on,  the  genotype  of 
any  particular  isolation  may  be  extremely  complex.  As,  moreover,  the  sporangio- 
spores are  delimited  and  the  walls  of  the  zygogametes  laid  down  without  any 
apparent  regard  either  to  the  condition,  the  number,  or  the  position  of  the  nuclei 
they  separate,  there  is  no  absolute  surety  that  two  single-spore  strains  derived  from 
an  original  single-spore  strain  will  have  the  same  genotype. 

In  such  Mucorineca  it  is,  therefore,  totally  impossible  to  comply  with  the  criterion 
of  specific  purity  which  is  imperative  in  this  issue.  There  is  an  almost  equal  lack  of 
certainty  of  specific  purity  in  the  genera  Aspergillus  and  Penicillium.  The  mycelium 
of  these  fungi  is  multicellular  and  each  cell  contains  many  nuclei.  On  asexual 
reproduction  a  vegetative  cell  gives  rise  to  a  conidiophore,  and  from  this  conidia  are 
abstricted.  The  hyphal  mother  cell  is  multinucleate  and  these  nuclei,  by  division  and 
wandering,  finally  pass  into  the  reproductive  cells,  these,  in  many  species  of  the  fungi, 
containing  an  indefinite  number  of  nuclei.  In  other  species  the  conidia  generally 
contain  but  a  single  nucleus,  but  almost  any  preparation  of  such  conidia  shows  multi- 
nucleate individuals. 

A  single-spore  strain  thus  reproduces  the  genotype  of  its  parent,  and  if  the  latter 
contains  nuclei  of  different  constitutions,  this  heterocaryotic  condition  will  be  main- 
tained in  the  progeny. 

Now,  in  the  case  of  Phycomyces,  genetic  contamination  occurs  in  the  sexual  process, 
for  many  of  these  forms  are  allogamous.  There  are,  however,  no  species  of 
Aspergillus  or  Penicillium  known  in  which  sexuality  is  necessarily  allogamous, 
although  there  is  no  certainty  that  this  type  of  reproduction  may  not  occur,  at  all 
events,  at  intervals.  We  still  know  too  little  of  what  takes  place  in  the  sexual 
processes  of  either  of  these  genera  to  make  their  discussion  profitable  ;  but  given  a 
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imioii   of    two   parental    niycelia   only  once   in   many  thousands  of  generations,  the 
inycelia  of  the  offspring  can  afford  no  certainty  of  specific  purity. 

Moreover  in  any  culture  of  a  single-spore  strain  of  either  fungus  it  is  not  usually 
difficult  to  find  two  hypha3  which  have  come  into  contact,  and  the  walls  separating 
them  have  been  auto-digested,  giving  cytoplasmic  continuity.  In  a  mixed  growth  of 
a  number  of  strains  these  somatic  fusions  are  also  to  be  found,  and  although  one  is  not 
able  to  state  that  fusions  occur  between  hyphal  cells  of  different  strains,  neither  may 
the  rare  possibility  of  this  be  denied.  If  now  from  such  a  contaminated  cell  in 
Aspergillus  or  Penicilliurn  a  conidiophore  were  to  arise,  or  either  or  both  the  sexual 
organs,  the  offspring  of  the  reproductive  bodies  would  be  genetically  impure,  and  asexual 
reproduction  would  only  maintain  this  heterocaryotic  or  heterozygotic  condition. 

Thus  even  accepting  the  evidence  of  Arcighovskij  (2),  Schiemann  (69),  and 
Waterman  (83),  it  by  no  means  foUows  that  the  permanent  changes  they  observed 
are  necessarily  mutations,  for  they  may  be  interpreted  in  terms  of  genetic  contamina- 
tion, and  in  no  case  may  these  organisms  comply  with  that  criterion  of  "  certainty  of 
sjjecific  purity  which  is  a  conditio  sine  qua  non  to  obtain  proof  of  the  existence  of 
mutation."  Although  in  this  account  very  little  reference  has  been  made  to  bacterial 
mutation,  there  can  be  little  doubt  that  the  possibility  of  the  rare  occurrence  of 
"somatic"  fusiong  in  these  organisms  places  them  out  of  consideration  as  subject 
organisms  for  the  experimental  induction  of  mutations. 

Thus,  according  to  the  principle  of  parsimony,  all  so-called  mutatioTis  in  these 
groups  of  organisms  may  perhaps  more  wisely  be  interpreted  in  terms  of  the  splitting 
of  an  originally  impure  genetic  constitution  or  of  gametic  or  somatic  segregation 
from  heterozygotes. 

These  considerations  applied  to  the  fungus  Botrytis  cinerea  throw  an  entirely  new 
light  upon  the  value  to  be  attached  to  the  colourless  strain.  Botrytis  cinerea 
possesses  a  multicellular  mycelium,  each  cell  of  which  contains  many  nuclei.  The 
conidiophores  are  multinucleate  and  a  small  but  indefinite  number  of  nuclei  pass  into 
each  conidium.  Throughout  the  whole  of  the  vegetative  and  reproductive  mycelium 
the  septa  are  laid  down  by  a  diaphragm-like  gi'owth  from  the  hyphal  walls,  irre- 
spective of  the  number  or  condition  of  the  nuclei  thus  separated.  Each  conidium, 
therefore,  merely  reproduces  the  genetic  constitution  of  the  original  cell  of  the 
mycelium  in  which  its  conidiophore  arose.  Many  thousands  of  such  asexual 
generations  would,  therefore,  not  alter  the  genetic  constitution  of  the  organism,  and 
there  is  no  sexual  process.  The  possibility  of  genetic  contamination  is  brought  about 
by  the  occurrence  of  hyphal  anastomoses.  In  the  extremely  rare  chance  of  a  fertile 
conidiophore  arising  from  a  cell  contaminated  by  the  nuclei  or  cytoplasm  of  a  geno- 
typically  different  individual  lies,  I  believe,  the  explanation  of  the  colourless  form  of 
Botrytis  described.  While  this  possibility  exists,  it  is  more  consonant  with  the 
principles  of  scientific  methodology  to  accept  this  interpretation  than  to  formulate  a 
mutational  hypothesis  of  the  origin  of  the  aberrant  strain. 
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Before  concluding,  a  few  words  may  perhaps  be  said  concerning  the  role  of  the 
nucleus  in  the  heredity  of  multinucleate  fungi.  The  conidia  of  Aspergillus,  Pem- 
cillium  OT  Botrytis  contain  a  variable  number  of  nuclei,  and  yet  if  conidia  are  plated 
out  the  colonies  which  develop  are  essentially  similar  in  physiological  reactions  and 
morphological  properties  for  each  particular  species.  It  follows,  therefore,  that  the 
exact  number  of  nuclei  which  chance  to  be  enclosed  in  each  reproductive  body  has  no 
bearing  (m  the  absence  or  presence  of  particular  characters  in  the  progeny,  but  is  a 
purely  fortuitous  occurrence.  What  is  vital,  is,  as  Burgbff  (18)  has  shown,  not  the 
number  of  nuclei  in  each  cell  but  the  kind  of  nuclei  present.  Although  the  evidence 
on  this  point  is  meagre  there  would  yet  seem  no  reason  to  doubt  that  the  structure  of 
the  nucleus  in  fungi  is  in  general  fundamentally  similar  to  that  in  other  organisms. 

Following  Morgan  (55)  the  chromosomes  might  thus  be  visualised  as  linear  series 
of  loci,  like  threaded  beads.  At  each  locus  exist  specific  factors,  making  up  a 
reaction  system  the  elements  of  which  bear  a  more  or  less  specific  relationship  to  one 
another.  Thus,  eight  loci,  representing  the  factor  for  black  sclerotial  pigment  in  the 
chromosomes  of  as  many  identical  nuclei,  would  therefore  have  neither  greater  nor  less 
visible  expression  than  one  locus  in  the  chromosome  of  a  single  nucleus,  or  than  15 
loci  in  the  chromosomes  of  1 5  nuclei  in  a  single  reproductive  body.  Increased  number 
of  nuclei  does  not  signify  a  cumulative  value  for  any  particular  .character.  But  if 
owing  to  somatic  fusion  a  hyphal  cell  containing  eight  nuclei  with  loci  for  black 
pigment  became  possessed  of  one  or  more  contaminating  nuclei  from  a  genotypically 
different  individual,  and  if  this  particular  cell  gave  rise  to  a  sclerotium,  the  characters 
of  this  sclerotium  would  be  determined  by  the  reacting  systems  of  loci  present  in  the 
two  kinds  of  nuclei,  the  visible  characters  being  the  expression  of  the  resultant  of 
their  interaction.  The  contaminating  nuclei  might  find  themselves  in  a  cytoplasmic 
environment  so  foreign  to  their  normal  metabolism  as  to  remain  inert,  or  the  locus  for 
colour  might  be  counterbalanced  or  cancelled  by  the  locus  for  black  in  any  one  of  the 
eight  original  nuclei,  in  which  case  the  seven  remaining  nuclei  would  reproduce  the 
original  colour.  In  both  cases  the  sclerotial  characters  would  be  wholly  normal.  On 
the  other  hand  the  loci-  in  the  invading  nuclei  might  function  as  lethal  factors,  such 
as  Morgan  (54)  has  shown  to  exist  in  the  case  oi  Drosophila,  and  development  might 
be  partially  or  entirely  inhibited. 

Again,  the  factors  in  the  one  system  might  react  in  many  ways  with  those  of  the 
other  system  changing  the  physiological  and  morphological  characters  of  the  adult 
form  so  that  apparently  spontaneous  variation  in  several  independent  directions  might 
result,  producing  "mutants"  such  as  those  described  by  Argiohovsku,  Schiemann 
and  Waterman.  The  majority  of  the  factors  of  the  contaminating  nuclei  might 
remain  inert,  but  one  or  more  might  react  with  certain  factors  in  the  original  nuclei 
resulting  in  the  inhibition  of  one  or  moi'e  of  the  latter.  If,  for  example,  the  factor 
for  chromogen  were  thus  inhibited  or  destroyed  the  adult  would  be  an  albino  similar 
to    the    new   form    of  Botri/tis  cinerea  described.     If  there  are  several  factors  for 
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colour,  as  would  appear  probable  in  the  present  instance,  that  associated  with  sclerotial 
pigmentation  only  might  be  inhibited,  leaving  unaltered  the  factors  governing  the 
pigmentation  of  spores  or  haptera. 

On  the  other  hand,  it  is  not  improbable  that  the  contamination  may  be  purely 
cytoplasmic,  and  the  progeny  of  such  a  cell  will  therefore  still  be  homocaryotic.  The 
foreign  cytoplasm,  however,  would  create  a  new  environment  through  which  the 
nuclei  would  function,  and  this  might  so  influence  the  development  of  the  organisms 
as  to  alter  materially  the  outward  expression  of  one  or  more  characters  in  the  adult 
form.  As  the  added  cytoplasm  would  now  be  an  integral  portion  of  the  organism, 
this  changed  expression  would  be  permanent  and  heritable.  It  is  thus  possible  for 
apparent  "  mutations  "  to  occur  in  the  fungi  without  any  change  in  nuclear  character. 
So  far  as  the  nucleus  is  concerned,  regarding  this  structure  as  an  entity  distinct  from 
the  cytoplasm,  the  change  would  merely  be  phenotypic ;  but  for  the  complete 
organism  the  change  would  be  genotypic.  It  is  perhaps  in  such  purely  cytoplasmic 
contammations  that  apparent  mutations  in  those  organisms  having  uninucleate  cells 
may  be  sought. 

In  the  case  under  discussion,  nuclear  and  cytoplasmic  contamination  must  have 
occurred  prior  to  the  isolation  of  the  single-spore  strain,  and  yet  the  colourless  strain 
arose  only  in  the  third  tube-generation  subsequent  to  this.  If,  however,  one  assumes 
a  contaminated  original  strain,  it  is  not  inconceivable  that  in  the  fortuitous  septation 
of  the  hyphee,  certain  nuclei  and  cytoplasmic  matter  were  separated,  such  that  in  all 
things  save  chromogen  formation  the  resultant  resembled  the  parental  strain.  The 
expression  of  this  resultant  would  be  an  albino  strain  such  as  that  described.  Such 
a  concomitance  of  conditions  might  well  appear  almost  inconceivable,  but  the  appear- 
ance of  an  albinistic  strain  of  the  fungus  is  equally  almost  inconceivable,  for  in  the 
history  of  the  genus  as  we  know  it  it  has  occurred  but  once. 

It  would  seem  to  me  probable  that  the  same  concept  which  Bateson  (8)  and 
LoTSY  (48)  have  formulated  for  the  higher  organisms  may  hold  true  for  both  fungi 
and  bacteria,  but  here  the  recombination  of  genes  may  not  only  take  place  by  sexual 
fusions,  but  also  by  direct  "  contamination  "  of  one  species  by  the  nuclei  and  cytoplasm 
of  another  in  vegetative  anastomoses.  New  reacting  systems  of  chromosomal  loci 
impinge  upon  each  other,  and  the  resultant  is  the  new  form.  This  at  first  may  be 
heterocaryotic,  but  if  true  sexual  processes  occur  later,  the  offspring  become  homo- 
caryotic but  heterozygous.  Such  a  hypothesis  fits  the  facts  and  explains  them  as  I 
think  no  other  present  hypothesis  does. 

It  is  not  the  possibility  of  mutation  in  the  fungi  (and  bacteria)  that  is  here  denied, 
but  rather  the  compulsion  to  accept  that  interpretation  of  the  evidence  as  it  now 
stands.  As  the  facts  present  themselves  to  me,  avenues  of  interpretation  other  than 
that  of  mutation  are  still  widely  open. 
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DESCRIPTION    OF    PLATE. 

Fig.  1. — Petri  dish  culture  in  which  the  normal  form  and  its  colourless  derivative 
have  been  grown.  Pigmented  haptera  of  the  albino  strain  may  be  seen 
at  the  periphery  of  the  culture  in  contact  with  the  glass.      X  f- 

Fig.  2. — Vertical  section  of  colourless  sclerotium.      X  250. 

Fig.  3. — Vertical  section  of  parental  sclerotium.      X  250. 
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Definitions. 

Centre  of  Location. — That  abscissa  of  a  frequency  curve  for  which  the  sampling  errors 
of  optimum  location  are  uncorrelated  with  those  of  optimum  scaling.     (9.) 

Consistency. — A  statistic  satisfies  the  criterion  of  consistency,  if,  when  it  is  calculated 
from  the  whole  population,  it  is  equal  to  the  required  parameter.     (4.) 

Distribution. — Problems  of  distribution  are  those  in  which  it  is  required  to  calculate 
the  distribution  of  one,  or  the  simultaneous  distribution  of  a  number,  of  functions  of 
quantities  cEstributed  in  a  known  manner.     (.3.) 

Efficiency. — The  efficiency  of  a  statistic  is  the  ratio  (usually  expressed  as  a  percentage) 
which  its  intrinsic  accuracy  bears  to  that  of  the  most  efficient  statistic  possible.     It 
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expresses  the  proportion  of  the  total  available  relevant  information  of  which  that 
statistic  makes  use.     (4  and  10.) 

Efficiency  {Criterion).— The  criterion  of  efficiency  is  satisfied  by  those  statistics  which, 
when  derived  from  large  samples,  tend  to  a  normal  distribution  wth  the  least  possible 
standard  deviation.     (4.) 

Estitnation.—PioUems  of  estimation  are  those  in  which  it  is  required  to  estimate  the 
value  of  one  or  more  of  the  population  parameters  from  a  random  sample  of  the 
population.     (3.) 

Intrinsic  Accuracy. —The  intrinsic  accuracy  of  an  error  curve  is  the  weight  in  large 
samples,  divided  by  the  number  in  the  sample,  of  that  statistic  of  location  which  satisfies 
the  criterion  of  sufficiency.     (9.) 

Isostatistical  Regions.— 1{  each  sample  be  represented  in  a  generalized  space  of  which 
the  observations  are  the  co-ordinates,  then  any  region  throughout  which  any  set  of 
statistics  have  identical  values  is  termed  an  isostatistical  region. 

Likelihood.— The  likelihood  that  any  parameter  (or  set  of  parameters)  should  have 
any  assigned  value  (or  set  of  values)  is  jiroportional  to  the  probability  that  if  this  were 
so,  the  totality  of  observations  should  be  that  observed. 

Location. — The  location  of  a  frequency  distribution  of  known  form  and  scale  is  the 
process  of  estimation  of  its  position  with  respect  to  each  of  the  several  variates.     (8.) 

Optimum. — The  optimum  value  of  any  parameter  (or  set  of  parameters)  is  that  value 
(or  set  of  values)  of  which  the  likelihood  is  greatest.     (6.) 

Scaling.— The  scaling  of  a  frequency  distribution  of  known  form  is  the  process  of 
estimation  of  the  magnitudes  of  the  deviations  of  each  of  the  several  variates.     (8.) 

Specification. — Problems  of  specification  are  those  in  which  it  is  required  to  specify 
the  mathematical  form  of  the  distribution  of  the  hypothetical  population  from  which 
a  sample  is  to  be  regarded  as  drawn.     (3.) 

Sufficiency. — A  statistic  satisfies  the  criterion  of  sufficiency  when  no  other  statistic 
which  can  be  calculated  from  the  same  sample  provides  any  additional  information  as 
to  the  value  of  the  parameter  to  be  estimated.     (4.) 

Validity. — The  region  of  validity  of  a  statistic  is  the  region  comprised  within  its 
contour  of  zero  efficiency.     (10.) 

1.  The  Neglect  of  Theoretical  Statistics. 

Several  reasons  have  contributed  to  the  prolonged  neglect  into  which  the  study  of 
statistics,  in  its  theoretical  aspects,  has  fallen.  In  spite  of  the  immense  amount  of 
fruitful  labour  wliich  has  been  expended  in  its  practical  applications,  the  basic  principles 
of  this  organ  of  science  are  stUl  in  a  state  of  obscurity,  and  it  cannot  be  denied  that, 
during  the  recent  rapid  development  of  practical  methods,  fundamental  problems  have 
been  ignored  and  fundamental  paradoxes  left  unresolved.  This  anomalous  state  of 
statistical  science  is  strikingly  exemplified  by  a  recent  paper  (1)  entitled  "  The  Funda- 
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mental  Problem  of  Practical  Statistics,"  in  which  one  of  the  most  eminent  of  modern 
statisticians  presents  what  purports  to  be  a  general  proof  of  Bayes'  postulate,  a  proof 
which,  in  the  opinion  of  a  second  statistician  of  equal  eminence,  "  seems  to  rest  upon  a 
very  peculiar — not  to  say  hardly  supposable — relation."     (2.) 

Leaving  aside  the  specific  question  here  cited,  to  which  we  shall  recur,  the  obscurity 
which  envelops  the  theoretical  bases  of  statistical  methods  may  perhaps  be  ascribed 
to  two  considerations.  In  the  first  place,  it  appears  to  be  widely  thought,  or  rather 
felt,  that  in  a  subject  in  which  all  results  are  liable  to  greater  or  smaller  errors,  precise 
definition  of  ideas  or  concepts  is,  if  not  impossible,  at  least  not  a  practical  necessity. 
In  the  second  place,  it  has  happened  that  in  statistics  a  purely  verbal  confusion  has 
hindered  the  distinct  formulation  of  statistical  problems  ;  for  it  is  customary  to  apply 
the  same  name,  mean,  standard  deviation,  correlation  coefficient,  etc.,  both  to  the  true, 
value  which  we  should  like  to  know,  but  can  only  estimate,  and  to  the  particular  value 
at  which  we  happen  to  arrive  by  our  methods  of  estimation  ;  so  also  in  applying  the 
term  probable  error,  writers  sometimes  would  appear  to  suggest  that  the  former  quantity, 
and  not  merely  the  latter,  is  subject  to  error. 

It  is  this  last  confusion,  in  the  writer's  opinion,  more  than  any  other,  which  has  led 
to  the  survival  to  the  present  day  of  the  fundamental  paradox  of  inverse  probability, 
which  like  an  impenetrable  jungle  arrests  progress  towards  precision  of  statistical 
concepts.  The  criticisms  of  Boole,  Venn,  and  Chrystal  have  done  something  towards 
banishing  the  method,  at  least  from  the  elementary  text-books  of  Algebra  ;  but  though 
we  may  agree  wholly  with  Chrystal  that  inverse  probability  is  a  mistake  (perhaps  the 
only  mistake  to  which  the  mathematical  world  has  so  deeply  committed  itself),  there 
yet  remains  the  feeling  that  such  a  mistake  would  not  have  captivated  the  minds  of 
Laplace  and  Poisson  if  there  had  been  nothing  in  it  but  error. 

2.  The  Purpose  of  Statistical  Methods. 

In  order  to  arrive  at  a  distinct  formulation  of  statistical  problems,  it  is  necessary  to 
define  the  task  which  the  statistician  sets  himself :  briefly,  and  in  its  most  concrete 
form,  the  object  of  statistical  methods  is  the  reduction  of  data.  A  quantity  of  data, 
which  usually  by  its  mere  bulk  is  incapable  of  entering  the  mind,  is  to  be  replaced  by 
relatively  few  quantities  which  shall  adequately  represent  the  whole,  or  which,  in  other 
words,  shall  contain  as  much  as  possible,  ideally  the  whole,  of  the  relevant  information 
contained  in  the  original  data. 

This  object  is  accomplished  by  constructing  a  hypothetical  infinite  population,  of 
which  the  actual  data  are  regarded  as  constituting  a  random  sample.  The  law  of  distri- 
bution of  this  hypothetical  popidation  is  specified  by  relatively  few  parameters,  which 
are  sufiicient  to  describe  it  exhaustively  in  respect  of  all  qualities  under  discussion. 
Any  information  given  by  the  sample,  which  is  of  use  in  estimating  the  values  of  these 
parameters,  is  relevant  information.     Since  the  number  of  independent  facts  supplied  in 
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the  data  is  usually  far  greater  than  the  nuraber  of  facts  sought,  much  of  the  information 
supplied  by  any  actual  sample  is  irrelevant.  It  is  the  object  of  the  statistical  processes 
employed  in  the  reduction  of  data  to  exclude  this  irrelevant  information,  and  to  isolate 
the  whole  of  the  relevant  information  contained  in  the  data. 

When  we  speak  of  the  probability  of  a  certain  object  fulfilling  a  certain  condition,  we 
imagine  all  such  objects  to  be  divided  into  two  classes,  according  as  they  do  or  do  not 
fulfil  the  condition.  This  is  the  only  characteristic  in  them  of  which  we  take  cognisance. 
For  this  reason  probability  is  the  most  elementary  of  statistical  concepts.  It  is  a  para- 
meter which  specifies  a  simple  dichotomy  in  an  infinite  hypothetical  population,  and  it 
represents  neither  more  nor  less  than  the  frequency  ratio  which  we  imagine  such  a 
population  to  exhibit.  For  example,  when  we  say  that  the  probability  of  throwing  a 
five  ^¥ith  a  die  is  one-sixth,  we  must  not  be  taken  to  mean  that  of  any  six  throws  with 
that  die  one  and  one  only  ^vill  necessarily  be  a  five  ;  or  that  of  any  six  million 
throws,  exactly  one  million  will  be  fives  ;  but  that  of  a  hypothetical  population  of  an 
infinite  number  of  throws,  with  the  die  in  its  original  condition,  exactly  one-sixth  will 
be  fives.  Our  statement  wUl  not  then  contain  any  false  assumption  about  the  actual 
die,  as  that  it  will  not  wear  out  with  continued  use,  or  any  notion  of  approximation,  as 
in  estimating  the  probability  from  a  finite  sample,  although  this  notion  may  be  logically 
developed  once  the  meaning  of  probability  is  apprehended. 

The  concept  of  a  discontinuous  frequency  distribution  is  merely  an  extension  of  that  of 
a  simple  dichotomy,  for  though  the  number  of  classes  into  which  the  population  is 
divided  may  be  infinite,  yet  the  frequency  in  each  class  bears  a  finite  ratio  to  that  of  the 
whole  population.  In  frequency  curves,  however,  a  second  infinity  is  introduced.  No 
finite  sample  has  a  frequency  curve  :  a  finite  sample  may  be  represented  by  a  histogram, 
or  by  a  frequency  polygon,  which  to  the  eye  more  and  more  resembles  a  curve,  as  the 
size  of  the  sample  is  increased.  To  reach  a  true  curve,  not  only  would  an  infinite  number 
of  individuals  have  to  be  placed  in  each  class,  but  the  number  of  classes  (arrays)  into 
which  the  population  is  divided  must  be  made  infinite.  Consequently,  it  should  be 
clear  that  the  concept  of  a  frequency  curve  includes  that  of  a  hypothetical  infinite 
population,  distributed  according  to  a  mathematical  law,  represented  by  the  curve. 
This  law  is  specified  by  assigning  to  each  element  of  the  abscissa  the  corresponding 
element  of  probability.  Thus,  in  the  case  of  the  normal  distribution,  the  probability 
of  an  observation  falling  in  the  range  dx,  is 

1  _  (x-m)- 

e     '""■   dx. 


v'z. 


T^/9.1 


in  which  expression  x  is  the  value  of  the  variate,  while  m,  the  mean,  and  a,  the  standard 
deviation,  are  the  two  parameters  by  which  the  hypothetical  population  is  specified. 
If  a  sample  of  n  be  taken  from  such  a  popidation,  the  data  comprise  n  independent  facts. 
The  statistical  process  of  the  reduction  of  these  data  is  designed  to  extract  from  them 
all  relevant  information  respecting  the  values  of  m  and  a-,  and  to  reject  all  other 
information  as  irrelevant. 
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It  should  be  noted  that  there  is  no  falsehood  in  interpreting  any  set  of  independent 
measurements  as  a  random  sample  from  an  infinite  population  ;  foi  any  such  set  of 
numbers  are  a  random  sample  from  the  totality  of  numbers  produced  by  the  same 
matrix  of  causal  conditions  :  the  hypothetical  population  which  we  are  studying  is  an 
aspect  of  the  totality  of  the  effects  of  these  conditions,  of  whatever  nature  they  may  be. 
The  postulate  of  randomness  thus  resolves  itself  into  the  question,  "  Of  what  population 
is  this  a  random  sample  ?  "  which  must  frequently  be  asked  by  every  practical  statistician. 

It  will  be  seen  from  the  above  examples  that  the  process  of  the  reduction  of  data  is, 
even  in  the  simplest  cases,  performed  by  interpreting  the  available  observations  as  a 
sample  from  a  hypothetical  infinite  population  ;  this  is  a  fortiori  the  case  when  we  have 
more  than  one  variate,  as  when  we  are  seeking  the  values  of  coefficients  of  correlation. 
There  is  one  point,  however,  which  may  be  briefly  mentioned  here  in  advance,  as  it 
has  been  the  cause  of  some  confusion.  In  the  example  of  the  frequency  curve  mentioned 
above,  we  took  it  for  granted  that  the  values  of  both  the  mean  and  the  standard  deviation 
of  the  population  were  relevant  to  the  inquiry.  This  is  often  the  case,  but  it  sometimes 
happens  that  only  one  of  these  quantities,  for  example  the  standard  deviation,  is  required 
for  discussion.  In  the  same  way  an  infinite  normal  popidation  of  two  correlated  variates 
will  usually  require  five  parameters  for  its  specification,  the  two  means,  the  two  standard 
deviations,  and  the  correlation  ;  of  these  often  only  the  correlation  is  required,  or  if  not 
alone  of  interest,  it  is  discussed  without  reference  to  the  other  four  quantities.  In  such 
cases  an  alteration  has  been  made  in  what  is,  and  what  is  not,  relevant,  and  it  is  not 
surprising  that  certain  small  corrections  should  appear,  or  not,  according  as  the  other 
parameters  of  the  hypothetical  surface  are  or  are  not  deemed  relevant.  Even  more 
clearly  is  this  discrepancy  shown  when,  as  in  the  treatment  of  such  fourfold  tables  as 
exhibit  the  recovery  from  smallpox  of  vaccinated  and  unvaccinated  patients,  the  method 
of  one  school  of  statisticians  treats  the  proportion  of  vaccinated  as  relevant,  while 
others  dismiss  it  as  irrelevant  to  the  inquiry.     (3.) 

3.  The  Problems  of  Statistics. 

The  problems  which  arise  in  reduction  of  data  may  be  conveniently  divided  into  three 
types  :— 

(1)  Problems  of  Specification.     These  arise  in  the  choice  of  the  mathematical  form  of 

the  population. 

(2)  Problems  of  Estimation.     These  involve  the  choice  of  methods  of  calculating  from 

a  sample  statistical  derivates,  or  as  we  shall  call  them  statistics,  which  are  designed 
to  estimate  the  values  of  the  parameters  of  the  hypothetical  population. 

(3)  Problems    of    Distribution.      These   include  discussions    of   the    distribution    of 

statistics  derived  from  samples,  or  in  general  any  functions  of  quantities  whose 
distribution  is  known. 

It  will  be  clear  that  when  we  know  (1)  what  parameters  are  required  to  specify  the 
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population  from  which  the  sample  is  drawn,  (2)  how  best  to  calculate  from  the  sample 
estimates  of  these  parameters,  and  (3)  the  exact  form  of  the  distribution,  in  different 
samples,  of  our  derived  statistics,  then  the  theoretical  aspect  of  the  treatment  of  any- 
particular  body  of  data  has  been  completely  elucidated. 

As  regards  problems  of  specification,  these  are  entirely  a  matter  for  the  practical 
statistician,  for  those  cases  where  the  qualitative  nature  of  the  hypothetical  population 
is  known  do  not  involve  any  problems  of  this  type.  In  other  cases  we  may  know  by 
experience  what  forms  are  likely  to  be  suitable,  and  the  adequacy  of  our  choice  may 
be  tested  a  posteriori.  We  must  confine  ourselves  to  those  forms  which  we  know  how 
to  handle,  or  for  which  any  tables  which  may  be  necessary  have  been  constructed. 
More  or  less  elaborate  forms  will  be  suitable  according  to  the  volume  of  the  data. 
Evidently  these  are  considerations  the  nature  of  which  may  change  greatly  during  the 
work  of  a  single  generation.  We  may  instance  the  development  by  Pearson  of  a  very 
extensive  system  of  skew  curves,  the  elaboration  of  a  method  of  calculating  their  para- 
meters, and  the  preparation  of  the  necessary  tables,  a  body  of  work  which  has  enormously 
extended  the  power  of  modern  statistical  practice,  and  which  has  been,  by  pertinacity 
and  inspiration  alike,  practically  the  work  of  a  single  man.  Nor  is  the  introduction  of 
the  Pearsonian  system  of  frequency  curves  the  only  contribution  which  their  author  lias 
made  to  the  solution  of  problems  of  specification  :  of  even  greater  importance  is  the 
introduction  of  an  objective  criterion  of  goodness  of  fit.  For  empirical  as  the  specifica- 
tion of  the  hypothetical  population  may  be,  this  empiricism  is  cleared  of  its  dangers  if 
we  can  apply  a  rigorous  and  objective  test  of  the  adequacy  with  which  the  proposed 
population  represents  the  whole  of  the  available  facts.  Once  a  statistic,  suitable  for 
applying  such  a  test,  has  been  chosen,  the  exact  form  of  its  distribution  in  random 
samples  must  be  investigated,  in  order  that  we  may  evaluate  the  probability  that  a 
worse  fit  should  be  obtained  from  a  random  sample  of  a  population  of  the  type  con- 
sidered. The  possibility  of  developing  complete  and  self-contained  tests  of  goodness  of 
fit  deserves  very  careful  consideration,  since  therein  lies  our  justification  for  the  free 
use  which  is  made  of  empirical  frequency  formulae.  Problems  of  distribution  of  great 
mathematical  difficulty  have  to  be  faced  in  this  direction. 

Although  problems  of  estimation  and  of  distribution  may  be  studied  separately,  they 
are  intimately  related  in  the  development  of  statistical  methods.  Logically  problems  of 
distribution  should  have  prior  consideration,  for  the  study  of  the  random  distribution  of 
different  suggested  statistics,  derived  from  samples  of  a  given  size,  must  guide  us  in  the 
clioice  of  which  statistic  it  is  most  profitable  to  calculate.  The  fact  is,  however,  that 
very  little  progress  has  been  made  in  the  study  of  the  distribution  of  statistics  derived 
from  samples.  In  1900  Pearson  (15)  gave  the  exact  form  of  the  distribution  of  x",  tlie 
Pearsonian  test  of  goodness  of  fit,  and  in  1915  the  same  author  published  (18)  a  similar 
result  of  more  general  scope,  valid  when  the  observations  are  regarded  as  subject  to 
linear  constraints.  By  an  easy  adaptation  (17)  the  tables  of  probability  derived  from 
this  formula  may  be  made  available  for  the  more  numerous  cases  in  which  linear  con- 
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straints  are  imposed  upon  the  hypothetical  population  by  the  means  which  we  employ 
in  its  reconstruction.  The  distribution  of  the  mean  of  samples  of  n  from  a  normal 
population  has  long  been  known,  but  in  1908  "  Student "  (4)  broke  new  ground  by 
calculating  the  distribution  of  the  ratio  which  the  deviation  of  the  mean  from  its  popula- 
tion value  bears  to  the  standard  deviation  calculated  from  the  sample.  At  the  same 
time  he  gave  the  exact  form  of  the  distribution  in  samples  of  the  standard  deviation. 
In  1915  Fisher  (5)  published  the  curve  of  distribution  of  the  correlation  coefficient  for 
the  standard  method  of  calculation,  and  in  1921  (6)  he  published  the  corresponding 
series  of  curves  for  intraclass  correlations.  The  brevity  of  this  list  is  emphasised  by  the 
absence  of  investigation  of  other  important  statistics,  such  as  the  regression  coefficients, 
multiple  correlations,  and  the  correlation  ratio.  A  formula  for  the  probable  error  of  any 
statistic  is,  of  course,  a  practical  necessity,  if  that  statistic  is  to  be  of  service  :  and  in 
the  majority  of  cases  such  formulae  have  been  found,  chiefly  by  the  labours  of  Pearson 
and  his  school,  by  a  first  approximation,  which  describes  the  distribution  with  sufficient 
accuracy  if  the  sample  is  sufficiently  large.  Problems  of  distribution,  other  than  the 
distribution  of  statistics,  used  to  be  not  uncommon  as  examination  problenis  in  proba- 
bility, and  the  physical  importance  of  problems  of  this  type  may  be  exemplified  by  the 
chemical  laws  of  mass  action,  by  the  statistical  mechanics  of  Gibbs,  developed  by 
Jeans  in  its  application  to  the  theory  of  gases,  by  the  electron  theory  of  Lorentz,  and 
by  Planck's  development  of  the  theory  of  quanta,  although  in  all  these  appli- 
cations the  methods  employed  have  been,  from  the  statistical  point  of  view,  relatively 
simple. 

The  discussions  of  theoretical  statistics  may  be  regarded  as  alternating  between 
problems  of  estimation  and  problems  of  distribution.  In  the  first  place  a  method  of 
calculating  one  of  the  population  parameters  is  devised  from  common-sense  considera- 
tions :  we  next  require  to  know  its  probable  error,  and  therefore  an  approximate  solution 
of  the  distribution,  in  samples,  of  the  statistic  calculated.  It  may  then  become  apparent 
that  other  statistics  may  be  used  as  estimates  of  the  same  parameter.  When  the 
probable  errors  of  these  statistics  are  compared,  it  is  usually  found  that,  in  large  samples, 
one  particular  method  of  calculation  gives  a  result  less  subject  to  random  errors  than 
those  given  by  other  methods  of  calculation.  Attacking  the  problem  more  thoroughly, 
and  calculating  the  surface  of  distribution  of  any  two  statistics,  we  may  find  that  the 
whole  of  the  relevant  information  contained  in  one  is  contained  in  the  other  :  or,  in 
other  words,  that  when  once  we  know  the  other,  knowledge  of  the  first  gives  us  no 
further  information  as  to  the  value  of  the  parameter.  Finally  it  may  be  possible. to 
prove,  as  in  the  case  of  the  Mean  Square  Error,  derived  from  a  sample  of  normal  popula- 
tion (7),  that  a  particular  statistic  summarises  the  whole  of  the  information  relevant 
to  the  corresponding  parameter,  which  the  sample  contains.  In  such  a  case  the  problem 
of  estimation  is  completely  solved. 
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4.  Criteria  of  Estimation. 

The  common-sense  criterion  employed  in  problems  of  estimation  may  be  stated  thus  :— 
That  when  applied  to  the  whole  population  the  derived  statistic  should  be  equal  to  the 
parameter.  This  may  be  called  the  Criterion  of  Consistency.  It  is  often  the  only  test 
applied  :  thus,  in  estimating  the  standard  deviation  of  a  normally  distributed  population, 
from  an  ungrouped  sample,  either  of  the  two  statistics — 

cTi  =  -  A,/-  S  ( I  a;-.?  I )  (Mean  error) 

and 

0-,  =   a/  -  S  {x-J'Y  (Mean  square  error) 

will  lead  to  the  correct  value,  a-,  when  calculated  from  the  whole  population.  They  both 
thus  satisfy  the  criterion  of  consistency,  and  this  has  led  many  computers  to  use  the 
first  formula,  although  the  result  of  the  second  has  14  per  cent,  greater  weight  (7),  and 
the  labour  of  increasing  the  number  of  observations  by  14  per  cent,  can  seldom  be  less 
than  that  of  applying  the  more  accurate  formula. 

Consideration  of  the  above  example  will  suggest  a  second  criterion,  namely  : — That  in 
large  samples,  when  the  distributions  of  the  statistics  tend  to  normality,  that  statistic 
is  to  be  chosen  which  has  the  least  probable  error. 

This  may  be  called  the  Criterion  of  Efficiency.  It  is  evident  that  if  for  large  samples 
one  statistic  has  a  probable  error  double  that  of  a  second,  while  both  are  proportional 
to  «--,  then  the  first  method  applied  to  a  sample  of  in  values  will  be  no  more  accurate 
than  the  second  applied  to  a  sample  of  any  n  values.  If  the  second  method  makes  use 
of  the  whole  of  the  information  available,  the  first  makes  use  of  only  one-quarter  of  it, 
and  its  efficiency  may  therefore  be  said  to  be  25  per  cent.  To  calculate  the  efficiency  of 
any  given  method,  we  must  therefore  know  the  probable  error  of  the  statistic  calculated 
by  that  method,  and  that  of  the  most  efficient  statistic  which  could  be  used.  The 
square  of  the  ratio  of  these  two  quantities  then  measures  the  efficiency. 

The  criterion  of  efficiency  is  still  to  some  extent  incomplete,  for  different 
methods  of  calculation  may  tend  to  agreement  for  large  samples,  and  yet  differ  for 
all  finite  samples.  The  complete  criterion  suggested  by  our  work  on  the  mean 
square  error  (7)  is  : — 

That  the  statistic  chosen  should  summarise  the  whole  of  the  relevant  information 
supplied  by  the  sample. 

This  may  be  called  the  Criterion  of  Sufficiency. 

In  mathematic-al  language  we  may  interpret  this  statement  by  saying  that  if  0  be 
the  parameter  to  be  estimated,  0,  a  statistic  which  contains  the  whole  of  the  information 
as  to  the  value  of  0,  which  the  sample  supplies,  and  Oj  any  other  statistic,  then  the 
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surface  of  distribution  of  pairs  of  values  of  O^  and  62,  for  a  given  value  of  0,  is  such  that 
for  a  given  value  of  0,,  the  distribution  of  6-,  does  not  involve  9.  In  other  words,  when 
0,  is  known,  knowledge  of  the  value  of  0.  throws  no  further  light  upon  the  value  of  0. 

It  may  be  shown  that  a  statistic  which  fulfils  the  criterion  of  sufficiency  will  also 
fulfil  the  criterion  of  efficiency,  when  the  latter  is  applicable.  For,  if  this  be  so,  the 
distribution  of  the  statistics  will  in  large  samples  be  normal,  the  standard  deviations 
being  proportional  to  n~K     Let  this  distribution  be 

df  = /        .,e   !-'•■' '-'.rr         2^1-        ■^■'■i'  ^  ddxde.., 

'ItvctiO-.,  v   1  — )'■ 


then  the  distribution  of  0,  is 


1  _«!-» 


so  that  for  a  given  value  of  Q^  the  distribution  of  Q.^  is 

(^2  v  liTT  V  1  —r 
and  if  this  does  not  involve  6,  we  must  have 

I  ■0-2  =  (Ti ; 

showing  that  o-j  is  necessarily  less  than  a--,,  and  that  the  efficiency  of  0..  is  measured  by 
;•-,  when  r  is  its  correlation  in  large  samples  with  61. 

Besides  this  case  we  shall  see  that  the  criterion  of  sufficiency  is  also  applicable  to  finite 
samples,  and  to  those  cases  when  the  weight  of  a  statistic  is  not  proportional  to  the 
number  of  the  sample  from  which  it  is  calcidated. 

5.  Examples  of  the  Use  of  the  Criterion  of  Consistency. 

In  certain  cases  the  criterion  of  consistency  is  sufficient  for  the  solution  of  problems 
of  estimation.  An  example  of  this  occurs  when  a  fourfold  table  is  interpreted  as  repre- 
senting the  double  dichotomy  of  a  normal  surface.  In  this  case  the  dichotomic  ratios 
of  the  two  variates,  together  with  the  correlation,  completely  specify  the  four  fractions 
into  which  the  population  is  divided.  If  these  are  equated  to  the  four  fractions  into 
which  the  sample  is  divided,  the  correlation  is  determined  uniquely. 

In  other  cases  where  a  small  correction  has  to  be  made,  the  amount  of  the  correction 
is  not  of  sufficient  importance  to  justify  any  great  refinement  in  estimation,  and  it  is 
sufficient  to  calculate  the  discrepancy  which  appears  when  the  uncorrected  method  is 
applied  to  the  whole  population.     Of  this  nature  is  Sheppard's  correction  for  grouping, 
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and  it  will  illustrate  this  use  of  the  criterion  of  consistency  if  we  derive  formulae  for 
this  correction  without  approximation. 

Let  ^  be  the  value  of  the  variate  at  the  mid  point  of  any  group,  a  the  interval  of 
grouping,  and  x  the  true  value  of  the  variate  at  any  point,  then  the  /c"'  moment  of  an 
infinite  grouped  sample  is 

in  which  oif(x)  dx  is  the  frequency,  in  any  element  dx,  of  the  ungrouped  population,  and 


t. 


p^-^ 


TT 


p  being  any  integer. 

Evidently  the  k"'  moment  is  periodic  in  6,  we  will  therefore  equate  it  to 

A(,  +  A,  sin  0  +  A.,  sin  2^. . . 

+  B,  cos0  +  B,,cos20.... 


Then 


K  =  ^    X   \   de\     tf{x)dx 

As  =  -    X         ^msOdB\        c'f(x)dx. 


B,  =  ^    X         cossOddl       ff{x)dx. 


But 


therefore 


0 

= 

277 

(I 

.^- 

27rp, 

do 

= 

27r 
a 

di. 

sin 

se 

= 

sin 

f-'f. 

cos 

sO 

= 

cos 

2^    . 

hence 
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Inserting  the  values  1,  2,  3  and  4  for  k,  we  obtain  for  the  aperiodic  terms  of  the  four 
moments  of  the  grouped  population 

iA„  =         xf{x)  dx, 

2A,  =  J    ^  (.X-+  Yr,j/(a-)  dx, 

A  =  J  ^  [x'+'^^fix)  dx. 

If  we  ignore  the  periodic  terms,  these  equations  lead  to  the  ordinary  Sheppard 
corrections  for  the  second  and  fourth  moment.  The  nature  of  the  approximation  involved 
is  brought  out  by  the  periodic  terms.  In  the  absence  of  high  contact  at  the  ends  of  the 
curve,  the  contribution  of  these  will,  of  course,  include  the  terms  given  in  a  recent  paper 
by  Pearson  (8)  ;  but  even  with  high  contact  it  is  of  interest  to  see  for  what  degree  of 
coarseness  of  grouping  the  periodic  terms  become  sensible. 

Now 

1     ''  =  *      Kir  c(+ia 

As  = -■    ^         shisede]       ff{x)dx, 
=  ^         sin^lZ^cZ^  ff{x)dx, 


But 
therefore 


=  ^\      fix)dx\        fsin^^^(i^. 

2  r+^''  ,   .     27r.s-^  ,-  a         -Zttsx 

(  sm      ^  df  = cos cos  -n-s, 

TT.S-  J  -  X  ((, 


similarly  the  other  terms  of  the  different  moments  may  be  calculated. 
For  a  normal  curve  referred  to  the  true  mean 

,A,  =  (-)-^?^e-^, 

iB,  =  0, 
in  which 

a  =  27re. 

The  error  of  the  mean  is  therefore 

4<r-  9iT-' 


/    -Si-  _li:  -^^'  \ 

-2e{e  ^''sinO-^e  -"' sin  20  +  ^e  -'- sin  30-... j. 


2  Y^2 
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To  illustrate  a  coarse  grouiiing,  take  the  group  interval  equal  to  the  standard  deviation 
then 


and  the  error  is 


(T 


-  -  e-'"'-  sin  6 

IT 


with  sufficient  accuracy.     The  standard  error  of  the  mean  being  — ^ ,  we  may  calculate 

V  It 

the  size  of  the  sample  for  which  the  error  due  to  the  periodic  terms  becomes  ecjual  to 
one-tenth  of  the  standard  error,  by  putting 


lU\/v(,        -^ 

whence 

u  =  ^^  e'"'  =  13,7'JO  billion. 
100 

For  the  second  moment 
and,  if  we  put  _ 

4(7  t  , 


there  results 

11  —  soxif'*"  =175  billion. 

The  error,  while  still  very  minute,  is  thus  more  important  for  the  second  than  for 
the  first  moment. 
For  the  third  moment 

6^*s  '         -'  -" 


putting 


\/l5o-'  ,     , 


n 


1 


'JGOtt 

While  for  the  fourth  moment 


n  =  — — r.e""  =  147  IjHI 


ion 


el.  fS  or  «  0- 


so  that,  if  we  put, 

\/96o-' 
10-//, 


4 


3         ,.-• 


n  = ic'"'  =  ro4  billion. 

32007r* 
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In  a  similar  manner  the  exact  form  of  Sheppard's  correction  may  be  found  for  other 
curves  ;  for  the  normal  curve  we  may  say  that  the  periodic  terms  are  exceedingly  minute 
so  long  as  a  is  less  than  o-,  though  they  increase  very  rapidly  if  a  is  increased  beyond 
this  point.  They  are  of  increasing  importance  as  higher  moments  are  used,  not  only 
absolutely,  but  relatively  to  the  increasing  probable  errors  of  the  higher  moments. 
The  principle  upon  which  the  correction  is  based  is  merely  to  find  the  error  when  the 
moments  are  calculated  from  an  infinite  grouped  sample  ;  the  corrected  moment  therefore 
fulfils  the  criterion  of  consistency,  and  so  long  as  the  correction  is  small  no  greater 
refinement  is  required. 

Perhaps  the  most  extended  use  of  the  criterion  of  consistency  has  been  developed  by 
Pearson  in  the  "  Method  of  Moments."  In  this  method,  which  is  without  question  of 
great  practical  utility,  different  forms  of  frequency  curves  are  fitted  by  calculating  as 
many  moments  of  the  sample  as  there  are  parameters  to  be  evaluated.  The  parameters 
chosen  are  those  of  an  infinite  population  of  the  specified  type  having  the  same  moments 
as  those  calculated  from  the  sample. 

The  system  of  curves  developed  by  Pearson  has  foi;r  variable  parameters,  and  may 
be  fitted  by  means  of  the  first  four  moments.  For  this  purpose  it  is  necessary  to  confine 
attention  to  curves  of  which  the  first  four  moments  are  finite  ;  further,  if  the  accuracy 
of  the  fourth  moment  shoidd  increase  with  the  size  of  the  sample,  that  is,  if  its  probable 
error  should  not  be  infinitely  great,  the  first  eight  moments  must  be  finite.  This 
restriction  requires  that  the  class  of  distribution  in  which  this  condition  is  not  fulfilled 
should  be  set  aside  as  "  heterotypic,"  and  that  the  fourth  moment  should  become 
practically  valueless  as  this  class  is  approached.  It  should  be  made  clear,  however, 
that  there  is  nothing  anomalous  about  these  so-called  "  heterotypic  "  distributions 
except  the  fact  that  the  method  of  moments  cannot  be  applied  to  them.  More- 
over, for  that  class  of  distribution  to  which  the  method  can  be  applied,  it  has  not 
been  shown,  except  in  the  case  of  the  normal  curve,  that  the  best  values  will  be 
obtained  by  the  method  of  moments.  The  method  will,  in  these  cases,  certainly  be 
serviceable  in  yielding  an  approximation,  but  to  discover  whether  this  approximation 
is  a  good  or  a  bad  one,  and  to  improve  it,  if  necessary,  a  more  adequate  criterion  is 
reqiiired. 

A  single  example  will  be  sufficient  to  illustrate  the  practical  difficidty  alluded  to 
above.  If  a  point  P  lie  at  known  (unit)  distance  from  a  straight  Ime  AB,  and  lines  be 
drawn  at  random  through  P,  then  the  distribution  of  the  points  of  intersection  with 
AB  will  be  distributed  so  that  the  frequency  in  any  range  dx  is 

7..      1  dx 

dj  =  -. 


7r'  1+  {x—my 

in  which  x  is  the  distance  of  the  infinitesimal  range  dx  from  a  fixed  point  0  on  the  line, 
and  m  is  the  distance,  from  this  point,  of  the  foot  of  the  perpendicular  PM.     The  distri- 
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bution  will  be  a  symmetrical  one  (Type  VII.)  having  its  centre  at  x  =-  m  (fig.  1).  It  is 
therefore  a  perfectly  definite  problem  to  estimate  the  value  of  m  (to  find  the  best  value  of 
m)  from  a  random  sample  of  values  of  x.  We  have  stated  the  problem  in  its  simplest 
possible  form  :    only  one  parameter  is  required,  the  middle  point  of  the  distribution. 


-6 


-5-4-3-2-1  0  I  2  3  4 

Fig.  1.     Symmetrical  error  curves  of  equal  intrinsic  accuracy. 


df 


1      dx 


TT     \+X^' 


B 


7=   C      4 

2s/7r 


By  the  method  of  moments,  this  should  be  given  by  the  first  moment,  that  is  by  the 
mean  of  the  observations  :  such  would  seem  to  be  at  least  a  good  estimate.  It  is, 
however,  entirely  valueless.  The  distribution  of  the  mean  of  such  samples  is  in  fact  the 
same,  identically,  as  that  of  a  single  observation.  In  taking  the  mean  of  100  values  of 
X,  we  are  no  nearer  obtaining  the  value  of  m  than  if  we  had  chosen  any  value  of  x  out 
of  the  100.  The  problem,  however,  is  not  in  the  least  an  impracticable  one :  clearly 
from  a  large  sample  we  ought  to  be  able  to  estimate  the  centre  of  the  distribution  with 
some  precision  ;  the  mean,  however,  is  an  entirely  useless  statistic  for  the  purpose. 
By  taking  the  median  of  a  large  sample,  a  fair  approximation  is  obtained,  for  the  standard 

error  of  the  median  of  a  large  sample  of  n  is     "^ ,-,  which,  alone,  is  enough  to  show  that 

'^\/  n 

by  adopting  adequate  statistical  methods  it  must  be  possible  to  estimate  the  value  for 
m,  with  increasing  accuracy,  as  the  size  of  the  sample  is  increased. 

This  example  serves  also  to  illustrate  the  practical  difficulty  which  observers  often 
find,  that  a  few  extreme  observations  appear  to  dominate  the  value  of  the  mean.  In 
these  cases  the  rejection  of  extreme  values  is  often  advocated,  and  it  may  often  ha23pen 
that  gross  errors  are  thus  rejected.  As  a  statistical  measure,  however,  the  rejection  of 
observations  is  too  crude  to  be  defended  :  and  unless  there  are  other  reasons  for  rejec- 
tion than  mere  divergence  from  the  majority,  it  would  be  more  philosophical  to  accept 
these  extreme  values,  not  as  gross  errors,  but  as  indications  that  the  distribution  of 
errors  is  not  normal.     As  we  shall  show,  the  only  Pearsonian  curve  for  which  the  mean 
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is  the  best  statistic  for  locating  the  curve,  is  the  normal  or  gaussian  curve  of  errors.  If 
the  curve  is  not  of  this  form  the  mean  is  not  necessarily,  as  we  have  seen,  of  any  value 
whatever.  The  determination  of  the  true  curves  of  variation  for  different  types  of  work 
is  therefore  of  great  practical  importance,  and  this  can  only  be  done  by  different  workers 
recording  their  data  in  full  without  rejections,  however  they  may  please  to  treat  the 
data  so  recorded.  Assuredly  an  observer  need  be  exposed  to  no  criticism,  if  after 
recording  data  which  are  not  probably  normal  in  distribution,  he  prefers  to  adopt  some 
value  other  than  the  arithmetic  mean. 

6.  Formal  Solution  of  Problems  of  Estimation. 

The  form  in  which  the  criterion  of  sufficiency  has  been  presented  is  not  of  direct 
assistance  in  the  solution  of  problems  of  estimation.  For  it  is  necessary  first  to  know 
the  statistic  concerned  and  its  sm-face  of  distribution,  with  an  infinite  number  of  other 
statistics,  before  its  sufficiency  can  be  tested.  For  the  solution  of  problems  of 
estimation  we  require  a  method  which  for  each  particiilar  problem  will  lead  us 
automatically  to  the  statistic  by  which  the  criterion  of  sufficiency  is  satisfied.  Such  a 
method  is,  I  believe,  provided  by  the  Method  of  Maximum  Likelihood,  although  I  am 
not  satisfied  as  to  the  mathematical  rigour  of  any  proof  which  I  can  put  forward  to 
that  effect.  Readers  of  the  ensuing  pages  are  invited  to  form  their  own  opinion  as 
to  the  possibility  of  the  method  of  the  maximum  likelihood  leading  in  any  case  to  an 
insufficient  statistic.  For  my  own  part  I  should  gladly  have  withheld  publication  until 
a  rigorously  complete  proof  could  have  been  formulated  ;  but  the  number  and  variety 
of  the  new  results  which  the  method  discloses  press  for  publication,  and  at  the  same 
time  I  am  not  insensible  of  the  advantage  which  accrues  to  Applied  Mathematics  from 
the  co-operation  of  the  Pure  Mathematician,  and  this  co-operation  is  not  infrequently 
called  forth  by  the  very  imperfections  of  writers  on  Applied  Mathematics. 

If  in  any  distribution  involving  unknown  parameters  0„  6-,,  0-j,  ...  ,  the  chance  of 
an  observation  falling  in  the  range  dx  be  represented  by 

f{x,0,,0,,...)dx, 

then  the  chance  that  in  a  sample  of  n,  m,  fall  in  the  range  dx^,  n.,  in  the  range  dx.^,  and 
so  on,  will  be 


n(n,!) 

The  method  of  maximum  likelihood  consists  simply  in  choosing  that  set  of  values 
for  the  parameters  which  makes  this  quantity  a  maximum,  and  since  in  this  expression 
the  parameters  are  only  involved  in  the  function/,  we  have  to  make' 

S(log/) 
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a  maximum  for  variations  of  0^,  0.,,  0.„  &c.     In  this  form  the  method  is  applicable  to 
the  fitting  of  populations  involving  any  number  of  variates,  and  equally  to  discontinuous  ■ 
as  to  continuous  distributions. 

In  order  to  make  clear  the  distinction  between  this  method  and  that  of  Bates,  we 
will  apply  it  to  the  same  t}'pe  of  problem  as  that  which  Bates  discussed,  in  the  hope 
of  making  clear  exactly  of  what  kind  is  the  information  which  a  sample  is  capable  of 
suppljdng.  This  question  naturally  first  arose,  not  with  respect  to  populations  distri- 
buted in  frequency  curves  and  surfaces,  but  with  respect  to  a  population  regarded  as 
divided  into  two  classes  only,  in  fact  in  problems  of  probability.  A  certain  proportion, 
p,  of  an  infinite  population  is  supposed  to  be  of  a  certain  kind,  e.g.,  "  successes,"  the 
remainder  are  then  "  failures."  A  sample  of  n  is  taken  and  found  to  contain  x  successes 
and  y  failures.     The  chance  of  ol)taining  such  a  sample  is  evidently 

Applying  the  method  of  maximum  likelihood,  we  have 

*^  (log/)  =  x\og  p  +  y  log  (1  -p) 

whence,  differentiating  with  respect  to  2J,  in  order  to  make  this  quantity  a  maximum, 

X         y  A       X 

-  =  --"     oi"     p  =  -. 
p      1  —p  n 

The  question  then  arises  as  to  the  accuracy  of  this  determination.  This  question  was 
first  discussed  by  Bates  (10),  in  a  form  which  we  may  state  thus.  After  observing 
this  sample,  when  we  know  p,  what  is  the  probability  that  p  lies  in  any  range  dp  ?  In 
other  words,  what  is  the  frequency  distribution  of  the  values  of  p  in  populations  which 
are  selected  by  the  restriction  that  a  sample  of  n  taken  from  each  of  them  yields  x 
successes.  Without  further  data,  as  Bates  perceived,  this  problem  is  insoluble.  To 
render  it  capable  of  mathematical  treatment,  Bates  introduced  the  datum,  that  among 
the  populations  upon  wliich  the  experiment  was  tried,  those  in  wdiich  p  lay  in  the  range 
dp  were  equally  frequent  for  all  equal  ranges  dp.  The  probability  that  the  value  of  p 
lay  in  any  range  dp  was  therefore  assumed  to  be  simply  dp,  before  the  sample  was 
taken.     After  the  selection  effected  by  observing  the  sample,  the  probability  is  clearly 

proportional  to 

p'{l-j))"dp. 

After  giving  this  solution,  based  upon  the  particular  datum  stated,  Bates  adds  a 
scholium  the  purj^ort  of  wliicli  would  seem  to  be  that  in  the  absence  of  all  knowledge 
save  that  supplied  by  tlie  sample,  it  is  reasonable  to  assume  this  particular  a  priori 
distribution  of  p.  The  result,  the  datum,  and  the  fostidate  implied  by  the  scholium,  have 
all  been  somewhat  loosely  spoken  of  as  Bates'  Theorem. 
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The  postulate  would,  if  true,  be  of  great  importance  in  bringing  an  immense  variety 
of  questions  within  the  domain  of  probability.  It  is,  however,  evidently  extremely  arbi- 
trary. Apart  from  evolving  a  vitally  important  piece  of  knowledge,  that  of  the  exact 
form  of  the  distribution  of  values  of  p,  out  of  an  assumption  of  complete  ignorance,  it  is 
not  even  a  unique  solution.  Vqv  we  might  never  have  happened  to  direct  our  attention 
to  the  particular  quantity  p  :  we  might  equally  have  measured  probability  upon  an 
entirely  different  scale.     If,  for  instance, 

sin  e  =  2p-l, 

the  quantity,  0,  measures  the  degree  of  probability,  just  as  well  as  p,  and  is  even,  for 
some  purposes,  the  more  suitable  variable.  The  chance  of  obtaining  a  sample  of  x 
successes  and  y  failures  is  now 


n  ■ 


(l+sln0)^(l-sin(9)y; 


2"  a;!?/! 
applying  the  method  of  maximum  likelihood, 

S  (log/)  =  X  log(l+sinf))  +2/log(l— sine)  — /tlog  2, 
and  differentiating  with  respect  to  0, 

—    •  whence         sm  6  =    ^     , 


l  +  sine       l-sin0' 

an  exactly  equivalent  solution  to  that  obtained  using  the  variable  p.  But  what  a  priori 
assumption  are  we  to  make  as  to  the  distribution  oi  6  "i  Axe  we  to  assume  that  6  is 
equally  likely  to  lie  in  all  equal  ranges  dd  ?  In  this  case  the  a  priori  probability  will 
be  ddjir,  and  that  after  making  the  observations  will  be  proportional  to 

(1+sin  0)^(1 -sin  0)5'f?a 

But  if  we  interpret  tins  in  ternxs  of  j),  we  obtain 

Vp{l-2^) 

a  result  inconsistent  mth  that  obtained  previously.  In  fact,  the  distribution  previously 
assumed  for  p  was  equivalent  to  assuming  the  special  distribution  for  6, 

Li 

the  arbitrariness  of  which  is  fully  apparent  when  we  use  any  variable  other  than  p. 
In  a  less  obtrusive  form  the  same  species  of  arbitrary  assumption  underlies  the  method 
VOL.  ccxxii. — -A.  2  z 
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laiovvn  as  that  of  inverse  probability.  Thus,  if  the  same  observed  result  A  might  be 
the  consequence  of  one  or  other  of  two  hypothetical  conditions  X  and  Y,  it  is  assumed 
that  the  probabilities  of  X  and  Y  are  in  the  same  ratio  as  the  probabilities  of  A  occurring 
on  the  two  assumptions,  X  is  true,  Y  is  true.  This  amounts  to  assuming  that  before 
A  was  observed,  it  was  known  that  our  universe  had  been  selected  at  random  for  an 
infinite  population  in  which  X  was  true  in  one  half,  and  Y  true  in  the  other  half. 
Clearly  such  an  assumption  is  entirely  arbitrary,  nor  has  any  method  been  put  forward 
by  which  such  assumptions  can  be  made  even  with  consistent  uniqueness.  There 
is  nothing  to  prevent  an  irrelevant  distinction  being  drawn  among  the  hypothetical 
conditions  represented  by  X,  so  that  we  have  to  consider  two  hypothetical  possibilities 
Xi  and  X,„  on  both  of  which  A  will  occur  with  equal  frequency.  Such  a  distinction 
should  make  no  difference  whatever  to  our  conclusions  ;  but  on  the  principle  of  inverse 
probability  it  does  so,  for  if  previously  the  relative  probabilities  were  reckoned  to  be 
in  the  ratio  x  to  y,  they  must  now  be  reckoned  2x  to  y.  Nor  has  any  criterion  been 
suggested  by  which  it  is  possible  to  separate  such  irrelevant  distinctions  from  those 
which  are  relevant. 

There  would  be  no  need  to  emphasise  the  baseless  character  of  the  assumptions  made 
under  the  titles  of  inverse  probability  and  Bayes'  Theorem  in  view  of  the  decisive 
criticism  to  which  they  have  been  exposed  at  the  hands  of  Boole,  Venn,  and  Chrystal, 
were  it  not  for  the  fact  that  the  older  writers,  such  as  Laplace  and  PoissoN,  who  accepted 
these  assumptions,  also  laid  the  foundations  of  the  modern  theory  of  statistics,  and  have 
introduced  into  their  discussions  of  this  subject  ideas  of  a  similar  character.  I  must 
indeed  plead  guilty  in  my  original  statement  of  the  Method  of  the  Maximum  Likeli- 
hood (9)  to  having  based  my  argument  upon  the  principle  of  inverse  probability  ;  in  the 
same  paper,  it  is  true,  I  emphasised  the  fact  that  such  inverse  probabilities  were  relative 
only.  That  is  to  say,  that  while  we  might  speak  of  one  value  of  f  as  having  an  inverse 
probability  three  times  that  of  another  value  of  f,  we  naight  on  no  account  introduce 
the  differential  element  Af,  so  as  to  be  able  to  say  that  it  was  three  times  as  probable 
that  f  should  lie  in  one  rather  than  the  other  of  two  equal  elements.  Upon  considera- 
tion, therefore,  I  perceive  that  the  word  probability  is  wrongly  used  in  such  a  connection  : 
probability  is  a  ratio  of  frequencies,  and  about  the  frequencies  of  such  values  we  can 
know  nothing  whatever.  We  must  return  to  the  actual  fact  that  one  value  of  f,  of 
the  frequency  of  which  we  know  nothing,  would  yield  the  observed  result  three  times 
as  frequently  as  would  another  value  of  f.  If  we  need  a  word  to  characterise  this 
relative  property  of  different  values  of  ^,  I  suggest  that  we  may  spealv  without  confusion 
of  the  likelihood  of  one  value  of  f  being  thrice  the  likelihood  of  another,  bearing  always 
in  mind  that  likelihood  is  not  here  used  loosely  as  a  synonym  of  probabiUty,  but  simply 
to  express  the  relative  frequencies  with  which  such  values  of  the  hypothetical  quantity 
f  would  in  fact  yield  the  observed  sample. 

The  solution  of  the  problems  of  calculating  from  a  sample  the  parameters  of  the 
hypothetical  population,  which  we  have  put  forward  in  the  method  of  maximum  likeli- 
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liood,  consists,  tlien,  simply  of  choosing  such  values  of  these  parameters  as  have  the 
maximum  likeliliood.  Formally,  therefore,  it  resembles  the  calculation  of  the  mode  of 
an  inverse  frequency  distribution.  This  resemblance  is  quite  superficial  :  if  the  scale 
of  measurement  of  the  hypothetical  quantity  be  altered,  the  mode  must  change  its 
position,  and  can  be  brought  to  have  any  value,  by  an  appropriate  change  of  scale  ;  but 
the  optimum,  as  the  position  of  maximum  likelihood  may  be  called,  is  entirely  unchanged 
by  any  such  transformation.  Likelihood  also  differs  from  probability*  in  that  it  is  not 
a  differential  element,  and  is  incapable  of  being  integrated  :  it  is  assigned  to  a  particular 
point  of  the  range  of  variation,  not  to  a  particular  element  of  it.  There  is  therefore  an 
absolute  measure  of  probability  in  that  the  unit  is  chosen  so  as  to  make  all  the  elementary 
probabilities  add  up  to  unity.  There  is  no  such  absolute  measure  of  likelihood.  It 
may  be  convenient  to  assign  the  value  unity  to  the  maximum  value,  and  to  measure 
other  likelihoods  by  comparison,  but  there  will  then  be  an  iniimte  number  of  values 
whose  likelihood  is  greater  than  one-half.  The  sum  of  the  likelilioods  of  admissible 
values  will  always  be  infinite. 

Our  interpretation  of  Bayes'  problem,  then,  is  that  the  likelihood  of  any  value  of  ^J 
is  proportional  to 

and  is  therefore  a  maximum  when 

X 

p  =  -, 

n 

which  is  the  best  value  obtainable  from  the  sample  ;  we  shall  term  this  the  02')timum 
value  of  jp.  Other  values  of  f  for  which  the  likelihood  is  not  much  less  cannot,  however, 
be  deemed  unlikely  values  for  the  true  value  of  p.  We  do  not,  and  cannot,  know,  from 
the  information  supplied  by  a  sample,  anything  about  the  probability  that  p  should  lie 
between  any  named  values. 

The  reliance  to  be  placed  on  such  a  result  must  depend  upon  the  frequency  distribution 
of  X,  in  different  samples  from  the  same  population.  This  is  a  perfectly  objective 
statistical  problem,  of  the  kind  we  have  called  problems  of  distribution  ;  it  is,  however, 
capable  of  an  approximate  solution,  directly  from  the  mathematical  form  of  the 
likelihood. 

When  for  large  samples  the  distribution  of  any  statistic,  0,,  tends  to  normality,  we 

*  It  should  be  remarked  that  likelihood,  as  above  defined,  is  not  only  fundamentally  distinct  from 
mathematical  probability,  but  also  from  the  logical  "  probability  "  by  which  Mr.  Keynes  (21)  has  recently 
attempted  to  develop  a  method  of  treatment  of  uncertain  inference,  applicable  to  those  cases  where  we 
lack  the  statistical  information  necessary  for  the  application  of  mathematical  probability.  Although,  m 
an  important  class  of  cases,  the  likelihood  may  be  held  to  measure  the  degree  of  our  rational  belief  in  a 
conclusion,  in  the  same  sense  as  Mr.  Keynes'  "  jirobability,"  yet  since  the  latter  quantity  is  constrained, 
somewhat  arbitrarily,  to  obey  the  addition  theorem  of  mathematical  probability,  the  likelihood  is  a 
quantity  which  falls  definitely  outside  its  scope. 

2  z  2 
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may  write  down  the  chance  for  a  given  value  of  the  parameter  e*,  that  6^  should  lie  in 
the  range  dOi  in  the  form 

<7\/27r 

The  mean  value  of  Oi  will  be  the  true  value  6,  and  the  standard  deviation  is  a-,  the 
sample  being  assumed  sufficiently  large  for  us  to  disregard  the  dependence  of  o-  upon  d. 
The  likelihood  of  any  value,  6),  is  proportional  to 


(e,-ev- 

e      ■''■'  , 

this  quantity  having 

its  maximum 

.  value,  unity, 

when 

for 

a;>"g*-V- 

Differentiating  now  a 

second  time 

8T^'»g*=- 

1 

Now  cj)  stands  for  the  total  frequency  of  all  samples  for  which  the  chosen  statistic 
has  the  value  0,,  consequently  (f)  =  S'  (</)),  the  summation  being  taken  over  all  such 
examples,  where  <-/>  stands  for  the  probability  of  occurrence  of  a  certain  specified  sample. 
For  which  we  know  that 

log9S  =  C  +  S(log/), 

the  summation  being  taken  over  the  individual  members  of  the  sample. 
If  now  we  expand  log/  in  the  form 


iog/(^)  =  ic-«/(f*,)+y-e,i^iog/(«j  +  ^^J,iog/(e,)  +  ..., 

log/=  \ogj\  +  ae^,+  ^0^'+..., 


we  have 


log  ,/,  =  C  +  O-OiH  {<c)  +  }  6-e,'  S  (b)  +  ...  ; 

now  for  optimum  statistics 

S  (a)  =^  0, 

and  for  sufficiently  large  samples  S  (h)  differs  from  nh  only  by  a  quantity  of  order  v/~o-(, ; 
moreover,  0-9i  being  of  order  u~-,  the  only  terms  in  log  ,/,  which  are  not  reduced 
without  limit,  as  n  is  increased,  are 


log,/,  =  C  +  :Ln(j0-o; ; 
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hence 

Now  this  factor  is  constant  for  all  samples  which  have  the  same  value  of  6i,  hence 
the  variation  of  fl>  with  respect  to  6  is  represented  by  the  same  factor,  and  conse- 
quently 

logO)  =  C'  +  ^ube-ei'; 
whence 


where 


01  being  the  optimum  value  of  0. 
The  formula 


1  ?'   1       ^  T 


I 


o 


-    —  =    ^^2  log/ 

supplies  the  most  dii-ect  way  known  to  me  of  finding  the  probable  errors  of  statistics. 
It  may  be  seen  that  the  above  proof  applies  only  to  statistics  obtained  by  the  method 
of  maximum  likelihood.* 

For  example,  to  find  the  standard  deviation  of 

X 

P  =  - 

n 

*  A  similar  method  of  obtaining  the  standard  deviations  and  correlations  of  statistics  derived  from 
large  samples  was  developed  by  Pearson  and  Filon  in  1898  (16).  It  is  unfortunate  that  in  this  memoir 
no  sufficient  distinction  is  drawn  between  the  pojjulalion  and  the  sample,  in  consequence  of  which  the 
formulae  obtained  indicate  that  the  likelihood  is  always  a  maximum  (for  continuous  distributions)  when 
tlie  mea7i  of  each  variate  in  the  sample  is  equated  to  the  corresponding  mean  in  the  population  (16,  p.  232, 
'■  A,-  =  0  ").  If  this  were  so  the  mean  would  always  be  a  sufficient  statistic  for  location  ;  but  as  we  have 
already  seen,  and  will  see  later  in  more  detail,  this  is  far  from  being  the  case.  The  same  argument,  indeed, 
is  applied  to  all  statistics,  as  to  which  nothing  but  their  consistency  can  be  truly  affirmed. 

The  probable  errors  obtained  in  this  way  are  those  ajipropriate  to  the  method  of  maximum  likelihood 
but  not  in  other  cases  to  statistics  obtained  by  the  method  of  moments,  by  which  method  the  examples 
given  were  fitted.  In  the  '  Tables  for  Statisticians  and  Biometricians  '  (1914),  the  probable  errors  of  the 
constants  of  the  Pearsonian  curves  arc  those  proper  to  the  method  of  moments  ;  no  mention  is  there  made 
of  this  change  of  practice,  nor  is  the  publication  of  1898  referred  to. 

It  would  appear  that  shortly  before  1898  the  process  which  leads  to  the  correct  value,  of  the  probable 
errors  of  optimum  statistics,  was  hit  upon  and  found  to  agree  with  the  probable  errors  of  statistics  found 
by  the  method  of  moments  for  normal  curves  and  surfaces  ;  without  further  enquiry  it  would  appear  to 
have  been  assumed  that  this  process  was  valid  in  all  cases,  its  directness  and  simi)licity  being  peculiarly 
attractive.  The  mistake  was  at  that  time,  perhaps,  a  natural  one  ;  but  that  it  should  have  been  discovered 
and  corrected  without  revealing  the  inefficiency  of  the  method  of  moments  is  a  very  remarkable  circumstance. 

In  1903  the  correct  formulas  for  the  probable  errors  of  statistics  found  by  the  method  of  moments  are 
given  in  '  Biometrika  '  (19)  ;  references  are  there  given  to  Sheppaed  (20),  whose  method  is  emijloyed,  as 
well  as  to  Pearson  and  Filon  (16),  although  both  the  method  and  the  results  differ  from  those  of  the  latter. 
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in  samples  from  an  infinite  population  of  which  the  true  value  is  p, 

log/=  \ogp+y  log(l-p), 
3  1       f      X        y 


~2  *"& 


l0g/=    --2- 


Now  the  mean  value  of  .c  in  pn,  and  of  //  is  ( 1  -p)  n,  hence  the  mean  value  of 


W^'""-'''  -(l.^m; 


\2J     l—p 
therefore 

2    .  p(l-?j) 

the  well-known  formula  for  the  standard  error  of  p. 

7.  Satisfaction  of  the  Criterion  of  Sufficiency. 

That  the  criterion  of  sufficiency  is  generally  satisfied  by  the  solution  obtained  by 
the  method  of  maximum  likelihood  appears  from  the  following  considerations. 

If  the  mdividual  values  of  any  sample  of  data  are  regarded  as  co-ordinates  in 
hyperspace,  then  any  sample  may  be  represented  by  a  single  point,  and  the  frequency 
distribution  of  an  infinite  number  of  random  samples  is  represented  by  a  density 
distribution  in  hyperspace.  If  any  set  of  statistics  be  chosen  to  be  calculated  from 
the  samples,  certain  regions  will  provide  identical  sets  of  statistics  ;  these  may  be  called 
isostatistical  regions.  For  any  particular  space  element,  corresponding  to  an  actual 
sample,  there  will  be  a  particular  set  of  parameters  for  which  the  frequency  in  that 
element  is  a  maximum  ;  this  will  be  the  optimum  set  of  parameters  for  that  element. 
If  now  the  set  of  statistics  chosen  are  those  which  give  the  optimum  values  of  the 
parameters,  then  all  the  elements  of  any  part  of  the  same  isostatistical  region  will 
contain  the  greatest  possible  frequency  for  the  same  set  of  values  of  the  parameters, 
and  therefore  any  region  which  lies  wholly  within  an  isostatistical  region  will  contain 
its  maximum  frequency  for  that  set  of  values. 

Now  let  0  be  the  value  of  any  parameter,  §  the  statistic  calculated  by  the  method  of 
maximum  likelihood,  and  0i  any  other  statistic  designed  to  estimate  the  value  of  6, 
then  for  a  sample  of  given  size,  we  may  take 

to  represent  the  frequency  with  which  0  and  H,  lie  in  the  assigned  ranges  dO  and  dOi. 
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The  region  dOdO^  evidently  lies   wholly  in  the  isostatistical  region  dO.      Hence   the 
equation 

4  log /(e  J,  00  =  0 

C'd 

is  satisfied,  irrespective  of  0^,  by  the  value  0  —  0.     This  condition  is  satisfied  if 

f{ftJ,O,}  =  ri>{oJ)..j.'{0,  0,); 


for  then 


-     loo-    /■  =     -    loor    d, 


and  the  equation  for  the  optimum  degenerates  into 

^log^(0,  0)  =  O, 

which  does  not  involve  fl,. 

But  the  factorisation  of  /  into  factors  involving  {0,  6)  and  (0,  f?,)  respectively  is  merely 
a  mathematical  expression  of  the  condition  of  sufficiency ;  and  it  appears  that  any 
statistic  which  fulfils  the  condition  of  sufficiency  must  be  a  solution  obtained  by  the 
method  of  the  optimum. 

It  may  be  expected,  therefore,  that  we  shall  be  led  to  a  sufficient  solution  of  problems 
of  estimation  in  general  by  the  following  procedure.  Write  down  the  formula  for  the 
probability  of  an  observation  falling  in  the  range  dx  in  the  form 

f{6,  x)  dx, 

where  6  is  an  unknown  parameter.     Then  if 

L  =  S(log/), 

the  summation  being  extended  over  the  observed  sample,  L  differs  by  a  constant  only 
from  the  logarithm  of  the  likelihood  of  any  value  of  0.  The  most  likely  value,  0,  is 
found  by  the  equation 

d0  ' 

and  the  standard  deviation  of  0,  by  a  second  differentiation,  from  the  formula 

a-'L  _      1  . 

this  latter  formula  being  applicable  only  where  0  is  normally  distributed,  as  is  often 
the  case  with  considerable  accuracy  in  large  samples.  The  value  0-9  so  found  is  in 
these  cases  the  least  possible  value  for  the  standard  deviation  of  a  statistic  designed  to 
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estimate  the  same  parameter  ;   it  may  therefore  be  applied  to  calculate  the  efficiency  of 
any  other  such  statistic. 

When  several  parameters  are  determined  simultaneously,  we  must  equate  the  second 
differentials  of  L,  with  respect  to  the  parameters,  to  the  coefficients  of  the  quadratic 
terms  in  the  uidex  of  the  normal  expression  which  represents  the  distribution  of  the 
corresponding  statistics.     Thus  with  two  parameters, 

?n.  ^  1        _L  gL_  1        _!_ 

a=L        ,       1  r 

+ 


or,  in  effect,  n-/  is  found  by  dividing  the  Hessian  determinant  of  L,  with  respect  to  the 
parameters,  into  the  corresponding  minor. 

The  application  of  these  methods  to  such  a  series  of  parameters  as  occur  in  the  speci- 
fication of  frequency  curves  may  best  be  made  clear  by  an  example. 

8.  The  Efficiency  of  the  Method  op  Moments  in  Fitting  Curves  of  the 

Pearsonian  Type  III. 

Curves  of  Pearson's  Type  III.  offer  a  good  example  for  the  calculation  of  the  efficiency 
of  the  Method  of  Moments.     The  chance  of  an  observation  falling  in  the  range  dx  is 

df  = ^  tx—m\  ^   ~T  dx* 

■^        rt.p!    \    «     / 

By  the  method  of  moments  the  curve  is  located  by  means  of  the  statistic  /*,,  its  dimen- 
sions are  ascertained  from  the  second  moment  fx.,,  and  the  remaining  parameter  f  is 
determined  from  /3i.  Considering  first  the  problem  of  location,  if  a  and  f  were  known 
and  we  had  only  to  determine  m,  we  should  take,  according  to  the  method  of  moments, 

where  m^  represents  the  estimate  of  the  parameter  m,  obtained  by  using  the  method  of 
moments.     The  variance  of  m^  is,  therefore, 

3  3        Ms      a"(p-l-l) 

-r  m^  =  o-  ^,  =  —  -  ^-'- '-. 

If,  on  the  other  hand,  we  aim  at  greater  accuracy,  and  make  the  likelihood  of  the 
sample  a  maximum  for  variations  of  w,  we  have 

x—in\     ctl-c  —  ni 


L  =  -n  log  a-n  log  {p  !)  +pS   log  -^—^   -S 


'^      a    /         \    a 


*  The  expression,  x  1,  is  used  here  and  throughout  as  equivalent  to  the  Gaussian   II  (x),  or  to  V  (x+1), 
whether  x  is  an  integer  or  not. 
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and  the  equation  to  determine  m  is 

?^  =  -ps(-^)  +  ^'=0; (1) 


x—mj      a 


the  accuracy  of  the  value  so  obtained  is  found  from  the  second  differential, 
of  which  the  mean  value  is 


dm  \x-m 


n 


whence 

l^m     — • 

re 
We  now  see  that  the  efficiency  of  location  by  the  method  of  moments  is 

jo+l  p+l' 

Efficiencies  of  over  80  per  cent,  for  location  are  therefore  obtained  if  p  exceeds  9  ;  for 
p  =  I  the  efficiency  of  location  vanishes,  as  in  other  cases  where  the  curve  makes  an 
angle  with  the  axis  at  the  end  of  its  range. 

Turning  now  to  the  problem  of  scaling,  we  have,  by  the  method  of  moments, 

fx,  =  a-{p+l), 
whence,  knowing  p,  a  is  obtained.     Since 

2  /32-1        2 

<^  >.,  — ^i■J , 

n 


/Wl^^2^4_+Ml,3=      p  +  4       ^. 


we  must  have 

""         4re    "  8n     '"        2,{p+\)n 

on  the  other  hand,  from  the  value  of  L,  we  find  the  equation 

i  =  ->+'>^i«(■'-»)  =  » w 

to  be  solved  for  m  and  a  as  a  simultaneous  equation  with  (1)  ;   whence 

3m  da  a' ' 

and 
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of  which  the  mean  value  is 

7l{p+l) 

a' 

The  variance  of  a,  determined  from  this  pair  of  simultaneous  equations,  is  found  by 
dividing 


n 


by  the  determinant 


which  reduces  to 
whence 


dm' 

a'^ip-l) 

n 

n 

a'ip-l) 

11 

n{p+l) 

2 
P- 

n' 

,.2 

a 


and  the  efficiency  of  scaling  by  the  method  of  moments  is 

p  +  4  /J  +  4 

Efficiency  of  over  80  per  cent,  for  scaling  are,  therefore,  obtained  when  p  exceeds  11. 
The  efficiency  of  scaling  does  not,  however,  vanish  for  any  possible  value  of  ]},  though 
it  tends  to  zero,  as  j; 'approaches  its  limiting  value,  —1. 

Lastly,  f  is  found  by  the  method  of  moments  by  putting 


Now 


p+l 


A 


V  =  -(4A- 24,^3+ 3G  +  9/:^i/:(,-l-/33+35A), 
and  for  curves  of  Type  III, 

i«.  =  3  +  i5/3„ 

A, -2/3,/5,  +  4/5i,  =  /3,(3A+10), 

fi,  =  I  (A.  +  2/i,)  =  §  (3/3i^+13/3,  +  6), 

3A/„ 


hence 


^,-  =  ~—{^^f-ii  +  4.){(3,  +  4), 


^  §1    r,(;j  +  2)(p  +  6)  ^ 
?i  p+ 1 
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whence  it  follows,  since  n  is  large,  that 
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From  the  value  of  L, 


aL 


d 


which  equation  solved  for  m,  a  and  p  as  a  simultaneous  equation  with  (1)  and  (2),  will 
yield  the  set  of  values  for  the  parameters  which  has  the  maximum  lilcelihood.  To  find 
the  variance  of  the  value  of  p,  so  obtained,  observe  that 


c'L 


of  which  the  mean  value  is 


and 


dm  ^2) 

n 
ap 

mj  • 

a=^L 

oa  dp 

It 

dp'  ~ 

d' 

dp' 

log 

(p!) 

The  variance  of  jj,  derived  from  this  set  of  simultaneous  equations,  is  therefore  found 

bv  dividing  the  minor  of  ^^— ;-,  namely 

dp" 


by  the  determinant 

1 


^  —  1    a 


I 


p-[ 

1         p+1 


1 
P 
1 


-  1       3^1og(p!) 

p  dp'     ^  ^^    ' 


— , ■  2  ^-^,  log  (« !) h  —  ■ 

a'   p-l  I    dp'     ^^-^  '     p     p-J 


hence 


«^2|^log(p!)-^  +  A 


When  p  is  large. 


dp' 


d.n'     "^  ^^    ^     p^  2)'       ^\   •'      5p"     72^      J 


3  A  2 
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so  that,  approximately, 

'        n 
for  large  values  of  ]),  the  efficiency  of  the  method  of  moments  is,  therefore,  approximately 

__p[+lp        _ 

Efficiencies  of  over  80  per  cent,  occur  when  p  exceeds  38-1  (/5j  =  0-102)  ;  evidently 
the  method  of  moments  is  effective  for  determining  the  form  of  the  curve  only  when  it 
is  relatively  close  to  the  normal  form.  For  small  values  of  p,  the  above  approximation 
for  the  efficiency  is  not  adequate.  The  true  values  can  easily  be  obtained  from  the 
recently  published  tables  of  the  Trigamma*  function  (11).  The  following  values  are 
obtained  for  the  integral  values  of  p  from  0  to  5. 


p 

0 

1               2 

3               4               5 

Efficiency      .     . 

0 

0-0274     0-0871 

0-1532     0-2159     0-27 

An  interesting  point  which  may  be  resolved  at  this  stage  of  the  enquiry  is  to  find 
the  variance  of  m,  when  a  and  p  are  not  known,  derived  from  the  above  set  of  simul- 
taneous equations  ;  that  is  to  say,  to  calculate  the  accuracy  with  which  the  limiting 
point  of  the  curve  is  determined  ;  such  determinations  are  often  stated  as  the  result 
of  fitting  curves  of  limited  range,  but  their  probable  errors  are  seldom,  if  ever,  evaluated. 
To  obtain  the  greatest  possible  accuracy  with  which  such  a  point  can  be  determined 

we  must  divide  the  minor  of  '- — r,,  namely, 

dm' 


by 

a*   p—l  [" dp' '"^  ^^"'     p  '  JO 


n' 


rAl^i?-alog(p!)-^  +  i 


whence 


2 


lE 


"P      "  '■'    '     p     p- 


The  position  of  the  limiting  point  will,  when  j)  is  at  all  large,  evidently  be  determined 
with  much  less  accuracy  than  is  the  position,  as  a  whole,  of  a  curve  of  known  form  and 
size.     Let  n'  be  a  multiplier  such  that  the  position  of  the  extremity  of  a  curve  calculated 

*  It  is  sometimes  convenient  to  write  F  (x)  for  -—  log  (x\). 

dx-     ° 
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from  nn'  observations  will  be  determined  with  the  same  accuracy  as  the  position,  as  a 
whole,  of  a  curve  of  known  form  and  size,  can  be  determined  from  a  sample  of  n  observa- 
tions when  n  is  large.     Then 


u'  = 


but,  when  p  is  large, 


p+1^,  log  (;>!)-! 


and 


therefore 


^        dp-    .  2p  \        3p      ip- 

-^dp'^^^    '      p^p'       'SjA       5p'---'' 


,'   _    3„.2  /    >  ^  .  ^ 


7b'  =   fp        1  -    —  />  + 


2)p         1  bp" 

For  large  values  of  p  the  probable  error  of  the  determination  of  the  end-point  may  be 
found  approximately  by  multiplying  the  probable  error  of  location  by 

As  ]}  grows  smaller,  n'  diminishes  until  it  reaches  unity,  when  p  ^  1.  For  values  of 
p  less  than  1  it  would  appear  that  the  end-point  had  a  smaller  probable  error  than  the 
probable  error  of  location,  but,  as  a  matter  of  fact,  for  these  values  location  is  determined 
by  the  end-point,  and  as  we  see  from  the  vanishing  of  a-,;.,  whether  or  not  p  and  a 
are  known,  when^  —  1,  the  weight  of  the  determination  from  this  point  onwards  increases 
more  rapidly  than  n,  as  the  sample  increases.     (See  Section  10.) 

The  above  method  illustrates  how  it  is  possible  to  calculate  the  variance  of  any 
function  of  the  population  parameters  as  estimated  from  large  samples  ;  by  comparing 
this  variance  with  that  of  the  same  function  estimated  by  the  method  of  moments,  we 
may  find  the  efficiency  of  that  method  for  any  proposed  function.  The  above  examina- 
tion, in  which  the  determinations  of  the  locus,  the  scale,  and  the  form  of  the  curve  are 
treated  separately,  will  serve  as  a  general  criterion  of  the  application  of  the  method  of 
moments  to  curves  of  Type  III.  Special  combinations  of  the  parameters  will,  however, 
be  of  interest  in  special  cases.  It  may  be  noted  here  that  by  virtue  of  equation  (2)  the 
function  of  m  -]-a{p  -\-  I)  is  the  same,  whether  determined  by  moments  or  by  the  method 
of  the  optimum : 

"V  +  ^'>  (p,x  +  1 )  =  m  +  d{p  +  l). 

The  efficiency  of  the  method  of  moments  in  determining  this  function  is  therefore  100 
per  cent.  That  this  function  is  the  abscissa  of  the  mean  does  not  imply  100  per  cent, 
efficiency  of  location,  for  the  centre  of  location  of  these  curves  is  not  the  mean  (see  p.  340). 
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9.  Location  and  Scaling  of  Frequency  Curves  in  General. 

The  general  problem  of  the  location  and  scaling  of  curves  may  now  be  treated  more 
generally.  This  is  the  problem  which  presents  itself  with  respect  to  error  curves  of 
assumed  form,  when  to  find  the  best  value  of  the  quantity  measured  we  must  locate  the 
curve  as  accurately  as  possible,  and  to  find  the  probable  error  of  the  result  of  this  process 
we  must,  as  accurately  as  possible,  estimate  its  scale. 

The  form  of  the  curve  may  be  specified  by  a  function  </,,  such  that 

dfoz  e*®  di,  when    c  =  ■:!—l!l. 

In  this  expression  <j,  specifies  the  form  of  the  curve,  which  is  unaltered  by  variations 
of  a  and  m. 

When  a  sample  of  n.  observations  has  been  taken,  the  likelihood  of  any  combination 
of  values  of  a  and  m  is 

L  =  C  —  ll  log  rt  +  S  ((/)), 

whence 


dm  \d^    dml  a 

since 


^^=      '■ 

dm          a 

also 

da          a     ^'^       a 

since 

3a          a 

Differentiating  a 

second  time, 

l>t^'^^"^-' 

therefore 

This  expression  enables  us  to  compare  the  accuracy  of  error  curves  of  different  fonn, 
when  the  location  is  performed  in  each  case  by  the  method  which  yields  the  minimum 
error. 

Example  : — The  curve 

referred  to  in  Section  5  has  an  infinite  standard  deviation,  but  it  is  not  on  that  account 
an  error  curve  of  zero  accuracy,  for 
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Now 


hence 

The  quantity 


(j>"  —  —h         and         a- J  =  — 

n 


a'       'Id' 


which  is  the  factor  by  which  n  is  multiplied  in  calculating  the  weight  of  the  estimate 
made  from  n  measurements,  may  be  called  the  intrinsic  accuracy  of  an  error  curve.  In 
the  above  example  we  see  that  errors  distributed  so  that 

7  J-      a     dx 
dj  =  - 


TT  a"  +  X" 
have  the  same  intrinsic  accuracy  as  errors  distributed  according  to  the  normal  curve 

df  =  —7=  e"^  dx, 

rr  \'  Lit 

provided 

Fig.  1  illustrates  two  such  curves  of  equal  intrinsic  accuracy. 
Returning  now  to  the  general  problem  in  which 

L  =  C  —  n  log  «  +  S  (./)), 
we  have 

dr]xoa      a-  a 

and 

The  latter  expression  will  directly  give  the  accuracy  with  which  a  is  determined  only  if 

8m  9a 
and  we  can  always  arrange  that  this  shall  be  so  by  subtracting  from  ^-  the  quantity 

El 

Thus  in  a  Type  III.  curve  where,  referred  to  the  end  of  the  range. 


p-i 
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instead  of 


we  must  write 
then 

hence 


</*  =p'^og  i-i 


<p=p  loo-  e+p_i  _  e+^_i  ; 
i  +  'p-^  i+p-\ 

da        a 


a'     \    ^       ^+p-l      i+p-\         J 


of  wliich  the  mean  vahie  is 


-.(-^  +  2^-1-^-1-1)=--,, 


hence 


For  one  particular  point   of    origin,   therefore,   the  variations  of  the  abscissa  are 
uucorrelated  with  those  of  a  ;   this  point  may  be  termed  the  centre  of  location. 
Example  : — To  determine  the  centre  of  location  of  the  curve  of  Type  IV., 


fZ/oce-"--'^(l+f)- 


Here 


r  +  2 


,/>  =  -.tan->^-^log  l+f, 


<l'  =  -{"  +  r  +  -2i)l  +  t  , 


.j,"  =r  +  2l  +  f    +2  [ri-r  4-  2)  I  +  f "    ; 


from  these  we  find 


r+lr  +  2r  +  4: 


r  +  i    +j'^ 


777,       r+lr  +  -2. 


so  that 


r  +  4    + 


£^__ 


^"  r  +  4 

The  centre  of  location,  therefore,  at  tlie  distance  from  the  mode, 
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Exmnple  : — Determine  the  intrinsic  accuracy  of  an  error  curve  of  Type  IV.  and  the 
efficiency  of  the  method  of  moments  in  location  and  scaling. 
Since 

-77  r+  lr  +  2r  +  4: 

^     = =f. 7—, 

r  +  4   +v- 


a'        r  +  4   +1 


n    r+\r  +  -lr  +  4 
and  the  intrinsic  accuracy  of  the  curve  is 


I  7-+lr  +  2  r  +  4, 

but 

,  a'     r'^  +  v^ 


n    r-r-l 
therefore  the  elFiciency  of  the  metliod  of  moments  in  location  is 


7-^7--L(r  +  4'  +  i.^) 


(3) 


When  I'  =  0,  we  have  for  curves  of  Type  VII.  an  efficiency  of  location 

r+lr+2 

The  efficiency  of  location  of  these  curves  vanishes  at  r  =  1,  at  which  value  the  standard 
deviation  becomes  infinite.  Although  values  down  to  -1  give  admissible  frequency 
curves,  the  conventional  limit  at  which  curves  are  reckoned  as  heterotypic  is  at  r  —  7. 
For  this  value  the  efficiency  is 

49     121 +1'^ 
132'  49 +  K  ' 

which  varies  from  91-67  per  cent,  for  the  symmetrical  Type  VII.  curve,  to  37-12  per 
cent,  when  i-  ^  00  and  the  curve  to  Type  V. 
Turning  to  the  c^uestion  of  scaling,  we  find 


r  +  4'  +v- 
whence 


^"  r  +  4 

and 

2       a-     ■r  +  4   . 


It     2r  + 
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the  intrinsic  accuracy  of  scaling  is  therefore  independent  of  v.     Now  for  these  curves 


Pa  = r  +  6— ■ 


r-2r-3\  r-  +  v', 

so  that 

^.,-\  __  r'7^2  +ir(r-+I0?--12) 

and  


The  efficiency  of  the  method  of  moments  for  scaling  is  thus 


-2r-3  /•  +  4(rMV) 


r+1  {7-'r-2  +i'^(r-+10r-12)} 
when  »<  =  0,  we  have  for  curves  of  Type  VII.  an  efficiency  of  scaling 

12 


(4) 


1- 


rr+1 


The  efficiency  of  the  method  of  moments  in  scaling  these  curves  vanishes  at  r  =  3, 
where  ^^  becomes  infinite  ;   for  r  =  7,  the  efficiency  of  scaling  is 

55         49  +  1'' 


2     \7lb  +  \07v^ 


varying  in  value  from  78-57  per  cent,  for  the  symmetrical  Type  VII.  curve,  to  25-70 
per  cent,  when  ^  ^  co  and  the  curve  to  Type  V. 

10.  The  Efficiency  op  the  Method  of  Moments  in  fitting  the  Pearsonian 

Curves. 

The  Pearsonian  group  of  skew  curves  are  obtained  as  solutions  of  the  equation 

Idy  ^    -{x-m)  .  ...      (5) 

y  dx       (I  +  hx  +  ex- ' 

algebraically  these  fall  into  two  main  classes, 
and 

(// cc  (  1 -f  iL.  )       -    f,    "'■'"    a  dx, 

\       a'  I 
according  as  the  roots  of  the  quadratic  expression  in  (5)  are  real  or  imaginary. 


FOUNDATIONS  OF   THEORETICAL   STATISTICS.  343 

The  first  of  these  forms  may  be  rewritten 

/  -A  _  L+r 

djccil — jj        e  '^dx, 

r  being  negative,  showing  its  affinity  with  the  second  class. 

In  order  that  these  expressions  may  represent  frequency  curves,  it  is  necessary  that 
the  integral  over  the  whole  range  of  the  curve  should  be  finite  ;  this  restriction  acts  in 
two  ways : — 

(1)  When  the  curve  terminates  at  a  finite  value  of  x,  say  x  =  a^,  the  power  to  which 

a.  —  x  is  raised  must  be  greater  than  —  1 . 

(2)  When  the  curve  extends  to  infinity,  the  ordinate,  when  x  is  large,  must  diminish 

more  rapidly  than  — ; 
In  Fig.  2  is  shown  a  conspectus  of  all  possible  frequency  curves  of  the  Pearsonian  type  ; 

A 


He^erot■yplc  Limi|-f-=7 
Llmi^of(i  diagram  r=  3 


G  s  G' 

A.  Showing  region  of  validity  of  first  moment. 


Heferotypic  Limlf  r=7 
Limifofpdiagram  r:3 


G  S  G' 

B.  Showing  region  of  validity  of  second  moment. 
Fig.  2.     Conspectus  of  Pearsonian  system  of  frequency  curves. 

3   B   2 
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the  lines  AC  and  AC  represent  the  limits  along  which  the  area  between  the  curve  and 
a  vertical  ordinate  tends  to  infinity,  and  on  which  TOj,  or  m.,  takes  the  value  —  1  ;  the 
line  CC  represents  the  limit  at  which  unbounded  curves  enclose  an  infinite  area  with 
the  horizontal  axis  ;  at  this  limit  r  =  —  1. 
The  symmetrical  curves  of  Type  II. 

extend  from  the  point  N,  representing  the  normal  curve,  at  which  r  is  infinite,  through 
the  point  P  at  which  r  =  — 4,  and  the  curve  is  a  farabola,  to  the  point  B  (r  =  —2), 
where  the  curve  takes  the  form  of  a  rectangle  :  from  this  point  the  curves  are  U-shaped, 
and  at  A,  when  the  arms  of  U  are  hyperbolic,  we  have  the  limiting  curve  of  this  type, 
which  is  the  discontinuous  distribution  of  equal  or  unequal  dichotomy  [r  =  0). 

The  unsymmetrical  curves  of  Type  I.  are  divided  by  Pearson  into  three  classes 
according  as  the  terminal  ordinate  is  infinite  at  neither  end,  at  one  end  (J  curves),  or 
at  both  ends  (U  curves)  ;  the  dividing  lines  are  C'BD  and  CBD',  along  which  one  of 
the  terminal  ordinates  are  finite  {nii,  or  tn.j,  =  0)  ;  at  the  point  B,  as  we  have  seen,  both 
terminal  ordinates  are  finite. 

The  same  line  of  division  divides  the  curves  of  Type  III., 

df  (X  x^'e'''  dx, 

at  the  point  E  (^9  =  0),  representing  a  simple  exponential  curve  ;  the  J  curves  of  Type  III. 
extend  to  F  (p  ^  — 1),  at  which  point  the  integral  ceases  to  converge.  In  curves  of 
Type  III.,  r  is  infinite  ;  i-  is  also  infinite,  but  one  of  the  quantities  m,  and  m.,  is  finite, 
or  zero  (=  p)  ;   as  ])  tends  to  infinity  we  approach  the  normal  curve 

df  CC  e"-'''  dx. 

Type  VI.,  like  Type  III.,  consists  of  curves  bounded  only  at  one  end  ;  here  r  is 
positive,  and  both  m,  and  m,  are  finite  or  zero.  For  the  J  curves  of  Type  VI.  both 
w,  and  m,  are  negative,  but  for  the  remainder  of  these  curves  they  are  of  opposite  sign, 
the  negative  index  being  the  greater  by  at  least  unity  in  order  that  the  representative 
point  may  fall  above  CC  (r  =  —1). 

Type  V.  is  here  represented  by  a  parabola  separating  the  regions  of  Types  IV.  and  VI.  ; 
the  typical  equation  of  this  type  of  curve  is 


)■+;! 


df  a:  X      2  (J    X  dx. 

As  r  tends  to  infinity  the  curve  tends  to  the  normal  form  ;  the  integral  does  not 
become  divergent  until  ?-i—  =  1,  or  /  =  —  1.  On  curves  of  Type  V.,  then,  r  is  finite 
or  zero,  but  v  is  infinite. 
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In  Type  IV. 


2  \  -  r±*  J 

a'  ' 


we  have  written  r,  not  as  previously  for  the  dift'erence  between  wi,  and  m,.  for  these 
quantities  are  now  complex,  and  their  difference  is  a  pure  imaginary,  but  for  the  differ- 
ence divided  by  \'  — 1  ;  i>  is  then  real  and  finite  throughout  Type  IV.,  and  it  vanishes 
along  the  line  NS,  representing  the  symmetrical  curves  of  Type  VII. 


?•+- 


from  ?•  =  00  to  r  =  —  1 . 

The  Pearsonian  system  of  frequency  curves  has  hitherto  been  represented  by  the 
diagram  (13,  p.  66),  in  which  the  co-ordinates  are  [i^  and  ,8..  This  is  an  unsymmetrical 
diagram  which,  since  /3,  is  necessarily  positive,  places  the  symmetrical  curves  on  a 
boundary,  whereas  they  are  the  central  types  from  which  the  unsymmetrical  curves 
diverge  on  either  hand  ;  further,  neither  of  the  limiting  conditions  of  these  curves  can 
be  shown  on  the  ft  diagram  ;  the  luuit  of  the  U  curves  is  left  obscure,*  and  the  other 
limits  are  either  projected  to  infinity,  or,  what  is  still  more  troublesome,  the  line  at 
infinity  cuts  across  the  diagram,  as  occurs  along  the  line  r  =  3,  for  there  ft^  becomes 
infinite.  This  diagram  thus  excludes  all  curves  of  Types  VII.,  IV.,  V.,  and  VI.,  for  which 
r<3. 

In  the  ;8  diagram  the  condition  r  =  constant  yields  a  system  of  concurrent  straight 
lines.     The  basis  of  the  representation  in  fig.  2  lies  in  making  these  lines  parallel  and 

horizontal,  so  that  the  ordinate  is  a  function  of  r  only.     We  have  chosen  r  =  y—  - , 

y 

and  have  represented  the  limiting  types  by  the  simplest  geometrical  forms,  straight  lines 
and  parabolas,  by  taking 

A        ,.L'^  2      (1 +?/  +  «■-)  (l-y-.x-) 

It  might  have  been  thought  that  use  could  have  been  made  of  the  criterion, 


'       4(4/3,-3A)(2^,-3A-6) 

by  which  Pearson  distinguishes  these  curves  ;  but  this  criterion  is  only  valid  in  the 
region  treated  by  Pearson.  For  when  r  =  0,  k..  =  1,  and  we  should  have  to  place 
a  variety  of  curves  of  Types  VII.,  IV.,  V.,  and  VI.,  all  in  Type  V.  in  order  to  adhere  to 
the  criterion. 

This  diagram  gives,  I  believe,  the  simplest  possible  conspectus  of  the  whole  of  t!ie 
Pearsonian  system  of  curves  ;   the  inclusion  of  the  curves  beyond  r  =  3  becomes  neces- 

*  The  true  limit  is  the  line  /?;  =  /ii  +  1,  along  which  the  curves  degenerate  into  siiajjle  dioiiotomies. 
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sary  as  soon  as  we  take  a  view  unrestricted  by  the  method  of  moments  ;  of  the  so-called 
heterotypic  curves  between  r  =  3  and  r  =  7  it  should  be  noticed  that  they  not 
only  fall  into  the  ordinary  Pearsonian  types,  but  have  finite  values  for  the  moment 
coefficients  /3,  and  (i. ;  they  differ  from  those  in  which  r  exceeds  7,  merely  in  the  fact 
that  the  value  of  /32,  calculated  from  the  fourth  moment  of  a  sample,  has  an  infinite  probable 
error.  It  is  therefore  evident  that  this  is  not  the  right  method  to  treat  the  sample,  but 
this  does  not  constitute,  as  it  has  been  called,  "  the  failure  of  Type  IV.,"  but  merely 
the  failure  of  the  method  of  moments  to  make  a  valid  estimate  of  the  form  of  these 
curves.  As  we  shall  see  in  more  detail,  the  method  of  moments,  when  its  efficiency  is 
tested,  fails  equally  in  other  parts  of  the  diagram. 

In  expression  (3)  we  have  found  that  the  efficiency  of  the  method  of  moments  for 
location  of  a  curve  of  Type  IV.  is 


E 


•-7--1  {r  +  4:'  +  ,r) 


•+1  ?-  +  2r  +  4(r'  +  r') 


whence  if  we  substitute  for  r  and  v  in  terms  of  the  co-ordinates  of  our  diagram,  we  obtain 
a  general  formula  for  the  efficiency  of  the  method  of  moments  in  locating  Pearsonian 
curves,  which  is  applicable  within  the  boundary  of  the  zero  contour  (fig.  3).     This  may 


r  =  \ 


Fig.  3.     Region  of  validity  of  the  first  moment  (tlie  mean)  applied  in  the  location  of 
Pearsonian  curves  showing  contours  of  efliciencv. 


be  called  the  region  of  validity  of  the  first  moment ;  it  is  bounded  at  the  base  by  the 
line  r  =  1,  so  that  the  first  moment  is  valid  far  beyond  the  heterotypic  limit ;  its  other 
boundary,  however,  represents  those  curves  which  make  a  finite  angle  with  the  axis  at 
the  end  of  their  range  {iii„  or  to.,,  =  1)  ;  all  J  curves  (to,,  or  m..,  <  0)  are  thus  excluded. 
This  boundary  has  a  double  point  at  P,  which  thus  forms  the  apex  of  the  region  of  validity. 
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In  fig.  3  are  shown  the  contours  along  which  the  efficiency  is  20,  40,  60,  and  80  per  cent. 
For  high  efficiencies  these  contours  tend  to  the  system  of  ellipses, 

8a;'  +  6?y'  =  1-E. 

In  a  similar  manner,  we  have  obtained  in  expression  (4)  the  efficiency  of  the 
second  moment  in  fitting  Pearsonian  curves.  The  region  of  validity  in  this  case  is 
shown  in  fig.  4  ;  this  region  is  bounded  by  the  lines  r  =  3,  r  =  —4,  and  by  the  limits 


Fig.  4.     Region  of  validity  of  the  second  moment  (standard  deviation)  applied  in  scaling  of 
Pearsonian  curves,  showing  contours  of  efficiency. 

(m,,  or  m.,,  =  —  1)  on  which  r^-\-v^  vanishes.  This  statistic  is  therefore  valid  for  certain 
J  curves,  though  the  maximum  efficiency  among  the  J  curves  is  about  30  per  cent. 
As  before,  the  contours  are  centred  about  the  normal  curve  (N)  and  for  high  efficiencies 
tend  to  the  system  of  concentric  circles, 

12x'+12y- =  1-E, 

showing  that  the  region  of  high  efficiency  is  somewhat  more  restricted  for  the  second 
moment,  as  compared  to  the  first. 

The  lower  boundary  to  the  efficiencies  of  these  statistics  is  due  merely  to  their  probable 
errors  becoming  infinite,  a  weakness  of  the  method  of  moments  which  has  been  partially 
recognised  by  the  exclusion  of  the  so-called  heterotypic  curves  (/  <  7).  The  stringency 
of  the  upper  boundary  is  much  more  unexpected  ;  the  probable  errors  of  the  moments  do 
not  here  become  infinite  ;  only  the  ratio  of  the  probable  errors  of  the  moments  to  the 
probable  error  of  the  corresponding  optimum  statistics  is  great  and  tends  to  iiffinity  as 
the  size  of  the  sample  is  increased. 

That  this  failure  as  regards  location  occurs  when  the  curve  makes  a  finite  angle  witli 
the  axis  may  be  seen  by  considering  the  occurrence  of  observations  near  the  terminus 
of  the  curve. 

Let 

df  =  kx"  dx 
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in  the  neighbourhood  of  the  terminus,  then  the  chance  of  an  observation  falling  within 
a  distance  x  of  the  terminus  is 

and  the  chance  of  n  observations  all  failing  to  fall  in  this  region  is 

(1-/)" 
or,  when  n  is  great,  and  /  correspondingly  small, 

Equating  this  to  any  finite  probability,  e~",  we  have 

n 

or,  in  other  words,  if  we  use  the  extreme  observation  as  a  means  of  locating  the  terminus, 

the  error,  x,  is  proportional  to 

i_ 

n  "+' ; 

t 

when  a  <  1,  this  quantity  diminishes  more  rapidly  than  vi"',  and  consequently  for  large 
samples  it  is  much  more  accurate  to  locate  the  curve  by  the  extreme  observation  than 
by  the  mean. 

Since  it  might  be  doubted  whether  such  a  simple  method  could  really  be  more  accurate 
than  the  process  of  finding  the  actual  mean,  we  will  take  as  example  the  location  of 
the  curve  (B)  in  the  form  of  a  rectangle, 

,,.      dx                  <*>   ^       ^         fi- 
at =  —  >         m <  X  <  7/n —  > 

a  2  2 

and 

f?/=0, 
outside  these  limits. 

This  is  one  of  the  simplest  types  of  distribution,  and  we  may  readily  obtain  examples 

of  it  from  mathematical  tables.     Tlie  mean  of  the  distribution  is  w,  and  the  standard 

deviation  ^ — ,  the  error  »/ — to,  of  the  mean  obtained  from  n  observations,  when  n  is 
reasonably  large,  is  therefore  distributed  according  to  the  formula 

\  J  ^j^  ,:^-^  ax. 

a     ^      IT 


The  difference  of  the  extreme  observation  from  the  end  of  the  range  is  distributed 


according  to  the  fonuula 


11    -'-^ 


a 
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if  ^  is  the  dift'erence  at  one  end  of  the  range  and  ,j  the  difference  at  the  other  end,  the 
joint  distribution  (since,  when  n  is  considerable,  these  two  quantities  may  be  regarded 
as  independent)  is 


^^2^"      (kd,,. 


Now  if  we  take  the  mean  of  the  extreme  observations  of  the  sample,  our  error  is 


for  which  we  write  x  ;  writing  also  y  for  f  +  '/,  we  have  the  joint  distribution  of  x  and  y, 

,v,2      _fl 

-^,  e  «    dx  dy. 

For  a  given  value  of  x  the  values  of  y  range  from  2 1  x  j  to  oo ,  whence,  integrating  with 
respect  to  y,  we  find  the  distribution  of  x  to  be 

at  =  —  e   "■       dx, 

a 

the  double  exponential  curve  shown  in  fig.  5. 
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Fig.  .5.     Double  exponential  frequency  curve,  showing  distribution  of  23  deviations. 

The  two  error  curves  are  tlius  of  a  radically  different  form,  and  strictly  no  value  for 
the  efficiency  can  be  calculated  ;  if,  however,  we  consider  the  ratio  of  the  two  standard 
deviations,  then 

2  '2,  •         'i  n  m 

<Tfj,    _   (%_ .     a     _  6 

o-',,,    ~  2;*'   ■    12»  ~  n 

when  n  is  large,  a  quantity  which  diminishes  indefinitely  as  the  sample  is  increased. 
VOL.  ccxxii. — A.  3  c 
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For  exami:)le,  we  have  taken  from  Vega  (14)  sets  of  digits  from  the  table  of  Natural 
Logarithms  to  48  places  of  decimals.  The  last  blocl\:  of  four  digits  was  taken  from  the 
logarithms  of  100  consecutive  numbers  from  101  to  200,  giving  a  sample  of  100  numbers 
distributed  evenlv  over  a  limited  range.  It  is  sufficient  to  take  the  three  first  digits 
to  the  nearest  integer  ;  then  each  number  has  an  equal  chance  of  all  values  between  0 
and  1000.  The  true  mean  of  the  population  is  500,  and  the  standard  deviation  289. 
The  standard  error  of  the  mean  of  a  sample  of  100  is  therefore  28-9. 

Twenty-five  such  samples  were  taken,  using  the  last  five  blocks  of  digits,  for  the 
logarithms  of  numbers  from  101  to  600,  and  the  mean  determined  merely  from  the  highest 
and  lowest  number  occurring,  the  following  values  were  obtained  : — 


Digits. 

1st  hundred. 

2nd  hundred. 

3rd  hundred. 

4th  hundred. 

5th  hundred. 

Lowest. 
Highest. 

O 
h-1 

Highest. 

S 
o 

g 
<2 

<v 
■$■ 
o 

Highest. 

m—m. 

4^ 

O 
1-? 

bX) 

a 

1 

45-48 

24      978  +  1-0 

39 

980  +  9-5 

1 

999 

0 

16 

983  -  0-5 

18 

994 

-(-6-0 

41-44 

.35-5  993  -hl4-0 

3 

960  -18-5 

6 

997 

+1-5 

1 

978  -10-5 

4 

979 

-8-5 

37^0 

9      988  -  1-5 

11 

999  4-  5-0 

31 

984 

+7-5 

4 

978  -  9-0 

2 

986 

-6.0 

33-36 

7      995  +  1-0 

13 

997  +  5-0 

4 

998 

+  1-0 

0 

994  -  3-0 

3 

981 

-8-0 

29-32 

1       988  -  5-5 

3 

988  -  4-5 

4 

992 

-2-0 

1 

996  -  1-5 

21 

977 

-1-0 

It  wUl  be  seen  that  these  errors  rarely  exceed  one-half  of  the  standard  error  of  the 
mean  of  the  sample.  The  actual  mean  square  error  of  these  25  values  is  6  •  86,  while  the 
calculated  value,  -^^50,  is  7-07.  It  will  therefore  be  seen  that,  with  samples  of  only  100, 
there  is  no  exaggeration  in  placing  the  efficiency  of  the  method  of  moments  as  low  as 
6  per  cent,  in  comparison  with  the  more  accurate  method,  which  in  this  case  happens 
to  be  far  less  laborious. 

Such  a  value  for  the  efficiency  of  the  mean  in  this  case  is,  however,  purely  conven- 
tional, since  the  curve  of  distribution  is  outside  the  region  of  its  valid  application,  and 
the  two  curves  of  sampling  do  not  tend  to  assume  the  same  form.  It  is,  however, 
convenient  to  have  an  estimate  of  the  effectiveness  of  statistics  for  small  samples,  and 
in  such  cases  we  should  prefer  to  treat  the  curve  of  distribution  of  the  statistic  as  an 
error  curve,  and  to  judge  the  effectiveness  of  the  statistic  by  the  intrinsic  accuracy  of 
tlie  curve  as  defined  in  Section  9.  Thus  the  intrinsic  accuracy  of  the  curve  of  distri- 
bution of  the  mean  of  all  the  observations  is 


a 
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so  yielding  a  ratio  3/?i.     It  is  probable  that  this  quantity  may  prove  a  suitable  substitute 
for  the  efficiency  of  a  statistic  for  curves  beyond  its  region  of  validity. 

To  determine  the  efficiency  of  the  moment  coefficients  /3,  and  ^.,  in  determining  the 
form  of  a  Pearsonian  curve,  we  must  in  general  appl)^  the  method  of  Section  8  to  the 
calculation  of  the  simultaneous  distribution  of  the  four  parameters  of  those  curves  when 
estimated  by  the  method  of  maximum  likelihood.  Expressing  the  curve  by  the  formula 
appropriate  to  Type  IV.,  we  are  led  to  the  determinant 


r^-\  /■  +  2  r  +  4 


'r(r  +  4"  +  i/-) 


7'+l  ■/■  +  2l' 


«"(/•  + 4" +  i/^) 


7-+1  r  +  2 


rt(r  +  2"  +  i'') 


r  + 1  1/ 


r+I  /•  +  2  1' 
s^(r  +  4'  +  .'^) 


•+l(2/-  +  4  +  i/') 


r+1 


a(v  +  2"  +  i'') 
r  +  2  +  ^^ 


a(r  +  2  +)'-)         a(r  +  2''  +  i'') 


;■+!  r  +  2 


a  (7-  + 2"  +  .'-) 


a  (/•  +  2'  +  i'') 
|i.  log  F 


cv 


■^7-  loo-  I 

dr     ^ 


r+l 


a{r  +  -2'  +  i>-) 
r  +  2  +  u' 


|.logF 


as  the  Hessian  of  — L,  when 


('    sm 


edd. 


The  ratios  of  the  minors  of  this  determinant  to  the  value  of  the  determinant  give 
the  standard  deviations  and  correlations  of  the  optimum  values  of  the  four  parameters 
obtained  from  a  number  of  large  samples. 

In  discussing  the  efficiency  of  the  method  of  moments  in  respect  of  the  fm-m  of  the 
curve,  it  is  doubtful  if  it  be  possible  to  isolate  in  a  unique  and  natural  manner,  as  we 
have  done  in  respect  of  location  and  scaling,  a  series  of  parameters  which  shall  successively 
represent  different  aspects  of  the  process  of  curve  fitting.  Thus  we  might  find  the 
efficiencies  with  which  r  and  i-  are  determined  by  the  method  of  moments,  or  those  of 
the  parametric  functions  corresponding  to  /^,  and  f-i.,,  or  we  might  use  m^  and  ni.^  as 
independent  parameters  of  form  ;  but  in  all  these  cases  we  should  be  employing  an 
arbitrary  pair  of  measures  to  indicate  the  relative  magnitude  of  corresponding  contour 
ellipses  of  the  two  frecjuency  surfaces. 

For  the  symmetrical  series  of  curves,  the  Types  II.  and  VII..  the  two  systems  of 
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ellipses  are  coaxial,  the  deviations  of  r  and  i-  being  uncorrelated  ;  in  the  case  of  Type  VII . 
we  put  i-  =  0,  in  the  determinant  given  above,  which  then  becomes 


r+1 r+2 
r+4 

0 

r+l 

■r  +  2 

0 


r+l 
r  +  2 


2r+l 
r  +  4 


0 
1 


0     y 


r  +  2 
0 


-r  +  2 


0    MH^)-Ki)l 


and  falls  in  the  two  factors 

-2r  +  4  I 
so  that 


rtLhl':^)-,t!:\\-^ 


7  / 


r+lr  +  2^  ir  \  _  r+I" 


2r  +  4        \2/      ,^.  ^  2' 


na-{   = 


2  7-  +  2 


:■!  .     r 


r  +  2   F  [-]-2r+l r  +  4 


and 


4  r+ 1  r+2 


— ^'r(^)-Ki)j-^ 


+  4 


The  corresponding  expressions  for  the  method  of  moments  are 


no-,, 


3    r-r-2'(7--  +  r+10) 
8  ■      7^1  r^  r^5 


and 


.,       2    rr-rr-3(r-'-r+ 18) 
■J  r— 5  r— / 


Since  for  moderately  large  values  of  r,  we  have,  approximately, 


and 


1     /r"\ 16/  1      '< 

r  +  2'F(-)-2r+lr  +  4=-(l-^^^;, 


r+]    r  +  2^JF(^j-Fg)}-2.+  lr  +  4  =  G-=:- 
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or,  when  r  is  great, 


{r  +  ^'+l  r  +  '2  .,    )  r-l  ?--3  r-5 
(r'  + r+ 10)  r-r^' 

^      28-8. 


and  for  the  efficiency  of  r^, 


(/•  +  2'  +f  ■■■  )r+rr-5?'-7. 
(r--r+18)7-'r^'^^ 


or,  when  r  is  great. 


53-3 


The  following  table  gives  the  values  of  the  transcendental  quantities  required,  and 
the  efficiency  of  the  method  of  moments  in  estimating  the  value  of  v  and  r  from  samples 
drawn  from  Type  VII.  distribution. 


r. 

-  ir  +  1  )■  +  4. 

Efficiency 

of  1'^. 

r+V  r  +  2' 
-2r+l  J-  +  4. 

Efficiency 
of  r^. 

5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 

5-31271 
5-31736 
5-32060 
5-32296 
5-32472 
5-32607 
5-32713 
5-32797 
5-32866 
5-32919 

0 

0-2572 

0-4338 

0-5569 

0-6449 

0-7097 

0-7586 

0-7963 

0-8259 

0-8497 

5-9473 
5-9574 
5-9649 
5-970G 
5-9750 
5-9787 
5-9810 
5-9839 
5-9853 
5-9870 
5-9883 
5-9895 

0 

0-1687 

0-3130 

0-4403 

0-5207 

0-5935 

0-6519 

0-6990 

0-7376 

0-7694 

0-7959 

0-8182 

It  will  be  seen  that  we  do  not  attain  to  80  per  cent,  efficiency  in  estimating  the  form 
of  the  curve  until  /  is  about  17-2,  which  corresponds  to  fi..^  3-42.  Even  for  sym- 
metrical curves  higher  values  of  ft.,  imply  that  the  method  of  moments  makes  use  of 
less  than  four-fifths  of  the  information  supplied  by  the  sample. 
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On  the  other  side  of  the  normal  point,  among  the  Type  II.  curves,  very  similar  formulae 
apply.     The  fundamental  Hessian  is 


r-lr-2 
7- -4 


r-1 

r-2 


2r-l 
r-4 


'.^  0  *F^' 

0--2 


o 


0        iO(^VF/?::iiM 


r-2  '  L     \    2 

where  /  is  written  for  the  positive  quantity.  -  r,  whence 


,'^n 


■lla-f  = 


2  r-2-' 


•-2    F 


r-2 


-2  r-1  r-4 


and 


Now  since 


it  follows  that 


4  r-1 r-2 


na-iT  = 


7^^\,{^)-,l^-,^ 


f  ,t^]  =  f  "-i'-'    * 


2    /       r-2" 


'r-2    F 


3 , / r— J 


'/•-4' 


-2r-L  /•-4  =  r-S  F     — ^  )  -2/-/'-3, 


which  is  the  same  function  of  /•— 4  as 


-ji  _  /'■ 


'r-2    F  '^-)  -2r+  1  /•  +  4 


is  of  r. 

In  a  similar  manner 


•+1 


— ^^C^UfC^)} 


-2  r- I  r-4 


=  r-rr-2'O 


r-4      ,  r-3 


— |-2._2r+l, 


which  is  the  same  function  of  /•  — 3  as 


■+rr  +  2"-iF 


''/•-1\      .  I) 


is  of  /'. 


^Vinrj-^y]-^'-^^'-,, 
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In  all  these  functions  and  those  of  the  foUowmg  table,  r  must  be  substituted  as  a 
positive  quantity,  although  it  must  not  be  forgotten  that  r  changes  sign  as  we  pass  from 
Type  VII.  to  Type  II.,  and  we  have  hitherto  adhered  to  the  convention  that  r  is  to 
be  taken  positive  for  Type  VII.  and  negative  for  Type  II. 


-  2/-  -  1  r  -  4. 

r-l"r-2' 

r. 

Efficiency 
of  .V- 

--2r-lr-4. 

Efficiency 
of  r^. 

2 

4 

0 

4 

0 

3 

4-93480 

0-0576 

5-1595 

0-0431 

4 

5-15947 

0-2056 

5-5648 

0-1445 

5 

5-23966 

0-3590 

5-7410 

0-2613 

6 

5-27578 

0-4865 

5-8305 

0-3708 

7 

5-29472 

0-5857 

5-8813 

0-4653 

8 

5-30576 

0-6615 

5-9126 

0-5441 

9 

5-31271 

0-7198 

5-9331 

0-6090 

10 

5-31736 

0-7650 

5-9473 

0-6624 

11 

5-32060 

0-8005 

5-9574 

0-7063 

12 

5-32296 

0-8287 

5-9649 

0-7427 

13 

5-32472 

0-8516 

5-9706 

0-7731 

14 

5-32607 

0-8702 

5-9750 

0-7986 

15 

5-9787 

0-8202 

In  both  cases  the  region  of  validity  is  bounded  by  the  rectangle,  at  the  point  B 
(fig.  2,  p.  343).  Efficiency  of  80  per  cent,  is  reached  when  r  is  about  14- 1  {^.,  ==  2-65). 
Thus  for  symmetrical  curves  of  the  Pearsonian  type  we  may  say  that  the  method  of 
moments  has  an  efficiency  of  80  per  cent,  or  more,  when  fi.,  lies  between  2-65  and  3-42. 
The  limits  within  which  the  values  of  the  parameters  obtained  by  moments  cannot  be 
greatly  improved  are  thus  much  narrower  than  has  been  imagined. 


11.  The  Reason  for  the  Efficiency  of  the  Method  of  Moments  in  a  Small 
Region  surrounding  the  Normal  Curve. 

We  have  seen  that  the  method  of  moments  applied  in  fitting  Pearsonian  cm-ves  has 
an  efficiency  exceeding  80  per  cent,  only  in  the  restricted  region  for  which  /3.,  lies  between 
the  limits  2-65  and  3-42,  and  as  we  have  seen  in  Section  8,  for  which  /3,  does  not  exceed 
0-1.  The  contours  of  equal  efficiency  are  nearly  circular  or  elliptical  within  these 
limits,  if  the  curves  are  represented  as  in  fig.  2,  p.  343,  and  are  ultimately  centred  round 
the  normal  point,  at  which  point  the  efficiencies  of  all  parameters  tend  to  100  per  cent. 
It  was,  of  course,  to  be  expected  that  the  first  two  moments  would  have  100  per  cent, 
efficiencies  at  this  point,  for  they  happen  to  be  the  optimum  statistics  for  fitting 
the  normal  curve.  That  the  moment  coefficients  ^,  and  /3,  also  tend  to  100  per  cent, 
efficiency  in  this  region  suggests  that  in  the  immediate  neighbourhood  of  the  normal 
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curve  the  departures  from  normality  specified  by  the  Pearsonian  formula  agree  with 
those  of  that  system  of  curves  for  which  the  method  of  moments  gives  the  solution  of 
maximum  likelihood. 

The  system  of  curves  for  which  the  method  of  moments  is  the  best  method  of  fitting 
may  easily  be  deduced,  for  if  the  frequency  in  the  range  dx  be 

then 

^  1 
?.9     ^'' 

must  involve  x  only  as  polynomials  up  to  the  fourth  degree  ;   consequently 

y  —  ^  ' 

the  convergence  of  the  probability  integral  requiring  that  the  coefficient  of  ic'  should  be 
negative,  and  the  five  quantities  a,  pi,  f^,  ps,  Pi  being  connected  by  a  single  relation, 
representing  the  fact  that  the  total  probability  is  unity. 

Typically  these  curves  are  bimodal,  and  except  in  the  neighbourhood  of  the  normal 
point  are  of  a  very  different  character  from  the  Pearsonian  curves.  Near  this  point, 
however,  they  may  be  shown  to  agree  with  the  Pearsonian  type  ;  for  let 

represent  a  curve  of  the  quartic  exponent,  sufficiently  near  to  the  normal  curve  for  the 
squares  of  k^  and  k.^  to  be  neglected,  then 

-—  log  //  = , ,1  -  .^/;, 4 fc  - 

rt.'f  rr'  \  (T  (T   ; 

X 


neglecting  powers  of  k^  and  h.  Since  the  only  terms  in  the  denominator  constitute  a 
quadratic  in  x,  the  curve  satisfies  the  fundamental  equation  of  the  Pearsonian  type  of 
curves.  In  the  neighbourhood  of  the  normal  point,  therefore,  the  Pearsonian  curves 
are  equivalent  to  curves  of  the  quartic  exponent ;  it  is  to  this  that  the  efficiency  of  />(.; 
and  fj.,.  in  the  neighbourhood  of  the  normal  curve,  is  to  be  ascribed. 

12.  Discontinuous  Distributions. 

The  applications  hitherto  made  of  the  optimum  statistics  have  been  problems  in 
which  the  data  are  ungrouped,  or  at  least  in  which  the  grouping  intervals  are  so  small 
as  not  to  disturb  the  values  of  the  derived  statistics.     By  grouping,  these  continuous 
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distributions  are  reduced  to  discontinuous  distributions,  and  in  an  exact  discussion  must 
be  treated  as  such. 

If  fs  be  the  probability  of  an  observation  falling  in  the  cell  (.s),  ji,  being  a  function  of 
the  required  parameters  f),,  R, ...  ;  and  in  a  sample  of  N,  if  n,  arc  found  to  fall  into 
that  cell,  then 

If  now  we  write  /F,  =  p,N,  we  may  conveniently  put 

L  =  s(n.loff^^), 

where  L  differs  by  a  constant  only  from  the  logarithm  of  the  likelihood,  with  sign 
reversed,  and  therefore  the  method  of  the  optimum  will  consist  in  finding  the  minhnmn 
value  of  L.     The  equations  so  found  are  of  the  form 

3L  =  _sf^?^'Uo (6) 

It  is  of  interest  to  compare  these  formulae  with  those  obtained  by  making  the  Pearsonian 
•^  a  minimum. 
For 

and  therefore 

so  that  on  differentiating  by  dQ,  the  condition  that  ^^  should  be  a  minimum  for  variations 
of  e  is 

'7%:  dn\  _ 


-^{^■^"^    =0 (7) 

Equation  (7)  has  actually  been  used  (12)  to  "  improve  "  the  values  obtained  by  the 
method  of  moments,  even  in  cases  of  normal  distribution,  and  the  Poisson  series,  where 
the  method  of  moments  gives  a  strictly  sufficient  solution.  The  discrepancy  between 
these  two  methods  arises  from  the  fact  that  x"  is  itself  an  approximation,  applicable 
only  when  Ti,  and  n,  are  large,  and  the  difference  between  them  of  a  lower  order  of 
magnitude.     In  such  cases 

L  =  s(u.log^)  =  s(.T^log'ii±^)  =  sjx+|l--^,...|. 
\        "n,/  V  ^     m    I  I        -Im      6m'      J 

and  since 

S  (x)  =  0, 

VOL.   COXXII. A.  3    D 
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we  have,  when  x  is  in  all  cases  small  compared  to  m, 

\m/ 

as  a  first  approximation.  In  those  cases,  therefore,  when  x^  is  a  valid  measure  of  the 
departure  of  the  sample  from  expectation,  it  is  equal  to  2L  ;  in  other  cases  the  approxi- 
mation fails  and  L  itself  must  be  used. 

The  failure  of  equation  (7)  in  the  general  problem  of  finding  the  best  values  for  the 
parameters  may  also  be  seen  by  considering  cases  of  fine  grouping,  in  which  the  majority 
of  observations  are  separated  into  units.     For  the  formula  in  equation  (6)  is  equivalent  to 

where  the  summation  is  taken  over  all  the  observations,  while  the  formula  of 
equation  (7),  since  it  involves  «/,  changes  its  value  discontinuously,  when  one 
observation  is  gradually  increased,  at  the  point  where  it  happens  to  coincide  with  a 
second  observation. 

Logically  it  would  seem  to  be  a  necessity  that  that  population  which  is  chosen  in 
fitting  a  hypothetical  population  to  data  should  also  appear  the  best  when  tested  for 
its  goodness  of  fit.  The  method  of  the  optimum  secures  this  agreement,  and  at  the 
same  time  provides  an  extension  of  the  process  of  testing  goodness  of  fit,  to  those  cases 
for  which  the  y^  test  is  invalid. 

The  practical  value  of  ^^  lies  in  the  fact  that  when  the  conditions  are  satisfied  in 
order  that  it  shall  closely  approximate  to  2L,  it  is  possible  to  give  a  general  formula 
for  its  distribution,  so  that  it  is  possible  to  calculate  the  probability,  P,  that  in  a  random 
sample  from  the  population  considered,  a  worse  fit  should  be  obtained  ;  in  such  cases 
X^  is  distributed  in  a  curve  of  the  Pearsonian  Type  III., 

v'-:t 

.If  ..(£y  ft  ill 

or 

where  n'  is  one  more  than  the  number  of  degrees  of  freedom  in  which  the  sample  may 
differ  from  expectation  (17). 

In  other  cases  we  are  at  present  faced  with  the  difficulty  that  the  distribution  L 
requires  a-  special  investigation.  This  distribution  will  in  general  be  discontinuous  (as 
is  that  of  x^),  but  it  is  not  impossible  that  mathematical  research  will  reveal  the  existence 
of  effective  graduations  for  the  most  important  groups  of  cases  to  which  x'^  caimot 
be  applied. 
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We  shall   conclude  with  a  few  illustrations  of  important  types  of  discontinuous 
distribution. 

1.  The  Poisson  Series. 

nr  ■m''-         \ 

2 !  x\         I 

involves  only  the  single  parameter,  and  is  of  great  importance  in  modern  statistics. 
For  the  optimum  value  of  m, 

s{A(_,„,+,,iog.H,)}  =  o, 

whence 

s(|^-i)  =  o, 

or 

m  =  X. 

The  most  likely  value  of  m  is  therefore  found  by  taking  the  first  moment  of  the  series. 
Differentiating  a  second  time, 

L  =  s  /  _  A  \  =  -  il , 

so  that 

m 

o-A   —  —> 
n 

as  is  well  known. 

2.  Grouped  Normal  Data. 

In  the  case  of  the  normal  curve  of  distribution  it  is  evident  that  the  second  moment 
is  a  sufficient  statistic  for  estimating  the  standard  deviation  ;  in  investigating  a  sufficient 
solution  for  grouped  normal  data,  we  are  therefore  in  reality  finding  the  optimum 
correction  for  grouping  ;  the  Sheppard  correction  having  been  proved  only  to  satisfy 
the  criterion  of  consistency. 

For  grouped  normal  data  we  have 

p,  =  —j^  e     --  dx, 

and  the  optimum  values  of  hi  and  o-  are  obtained  from  the  equations, 


dm  ^,/)j  cm  I 


aL 

3  D  2 
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z  — 


we  have  the  two  conditions. 


and 


n. 


S^z, 


"s+i 


f5  i  —   —  2s ^2,+,  ir  =  0. 


'// 


0-1 

1-2 

2-3 

3-4 

4-5 

5-6 

6-7 

7-8 

8-9 

114 

84 

53 

24 

14 

6 

3 

1 

1 

As  a  simple  example  we  shall  take  the  case  chosen  by  K.  Smith  in  her  investigation  of 
the  variation  of  x'  in  the  neighbourhood  of  the  moment  solution  (12). 

Three  hundred  errors  in  right  ascension  are  grouped  in  nine  classes,  positive  and 
negative  errors  being  throAvn  together  as  shown  in  the  following  table  : — 

0"-l  arc 
Frequency     . 

The  second  moment,  without  correction,  yields  the  value 

0-,,  =  2-282542. 
Using  Sheppard's  correction,  we  have 

,7,  =  2-264214, 
while  the  value  obtained  by  making  yj  a  minimum  is 

o-,.  =  2-355860. 

If  the  latter  value  were  accepted  we  should  have  to  conclude  that  Sheppard's  correc- 
tion, even  when  it  is  small,  and  applied  to  normal  data,  might  be  altogether  of  the 
wrong  magnitude,  and  even  in  the  wrong  direction.     In  order  to  obtain  the  optimum 

value  of  0",  we  tabulate  the  values  of  ^:—  in  the  region  under  consideration ;  this  may 

ro- 
be done  without  great  labour  if  values  of  o-  be  chosen  suitable  for  the  direct  application 
of  the  table  of  the  probability  integral  (13,  Table  II.).     We  then  have  the  followmg 
values  :— 


1 

0-43 

0-44 

0-45 

«t 

0-46 

8L 

CO- 

+  15-135 

+  2-149 

-ir098 

-24-605 

0(7 

-0-261 

-   0-260 
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By  interpolation, 

I  =0-441624 

(7 

a  =  2-26437. 

We  may  therefore  summarise  these  results  as  follows  : — 

Uncorrected  estimate  of  a- 2  •  28254 

Sheppard's  correction —0-01833 

Correction  for  maximum  likelihood —0-01817 

"  Correction  "  for  minimum  x^ -|-0- 07332 

Far  from  shaking  our  faith,  therefore,  in  the  adequacy  of  Sheppard's  correction, 
when  small,  for  normal  data,  this  example  provides  a  striking  instance  of  its  effective- 
ness, while  the  approximate  nature  of  the  x"  test  renders  it  unsuitable  for  improving  a 
method  which  is  already  very  accurate. 

It  will  be  useful  before  leaving  the  subject  of  grouped  normal  data  to  calculate  the 
actual  loss  of  efficiency  caused  by  grouping,  and  the  additional  loss  due  to  the  small 
discrepancy  between  moments  with  Sheppard's  correction  and  the  optimum  solution. 

To  calculate  the  loss  of  efficiency  involved  in  the  process  of  grouping  normal  data,  let 

a  h-ha 
when  aa-  is  the  group  interval,  then 

.•  =/(f )  +f;.r  (f)  +  ^/"  (tH  ^j"  (f )  +  ... 

=  /(f)i'  +  ;;^(f-l)+-f4(f'-«r  +  3)+^(f'-lof'  +  45r-15)  +  . 
whence 

log  -  =  iog/+ |(r-i)  -  ^(r+4f-2) + ^(f +of.H-3r-.)- 

and 

— — .,  log  r  = -  +  —A (35-  + 2)  + (of*+12f-+l)  -  ...  f  , 

of  which  the  mean  value  is 

(7- 1        12       144      4320       J' 
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neglecting  the  periodic  terms ;  and  consequently 

"^   -,.l.    +r2"2880-' 
Now  for  the  mean  of  ungrouped  data 


a  a 

"■»!    —  — 

n 


so  that  the  loss  of  efficiency  due  to  grouping  is  nearly  — . 

The  further  loss  caused  by  using  the  mean  of  the  grouped  data  is  very  small,  for 


.  _  1/.  _  (t"  / ,      a"  \ 
neglecting  the  periodic  terms  ;  the  loss  of  efficiency  by  using  I'l  therefore  is  only 


2880 
Similarly  for  the  efficiency  for  scaling, 

|llog.  =  i-M:  +  l-|^(lOf-3)-^(9fH2lf-5) 

0(7  or  a-  a    [IZ  6o0 

of  which  the  mean  value  is 


6       40      270       129,(i00 
neglecting  the  periodic  terms  ;  and  consequently 


'   ~  2n\        G       360      10,800 '■■] 


For  ungrouped  data 


o-^i 


2n ' 


so  that  the  loss  of  efficiency  in  scaling  due  to  grouping  is  nearly  — .     This  may  be  made 

6 
as  low  as  1  per  cent,  by  keeping  a  less  than  J. 

The  further  loss  of  efficiency  produced  by  using  the  grouped  second  moment  with 

Sheppard's  correction  is  again  very  small,  for 


:;  _    I'i  —  f-j'  'Zar    I  -,    .    a'    ,       « 


n  ?(.   V        6       360 


neglecting  the  periodic  terms. 
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Whence  it  appears  that  the  further  loss  of  efficiency  is  only 


a° 


10,800 


We  may  conclude,  therefore,  that  the  high  agreement  between  the  optimum  value  of 
o-  and  that  obtained  by  Sheppard's  correction  in  the  above  example  is  characteristic 
of  grouped  normal  data.  The  method  of  moments  with  Sheppard's  correction  is  highly 
efficient  in  treating  such  material,  the  gain  in  efficiency  obtainable  by  increasing  the 
likelihood  to  its  maximum  value  is  triffing,  and  far  less  than  can  usually  be  gained  by 
using  finer  groups.  The  loss  of  efficiency  involved  in  grouping  may  be  kept  below 
1  per  cent,  by  making  the  group  interval  less  than  one-quarter  of  the  standard  deviation. 

Although  for  the  normal  curve  the  loss  of  efficiency  due  to  moderate  grouping  is  very 
small,  such  is  not  the  case  with  curves  making  a  finite  angle  with  the  axis,  or  having  at 
an  extreme  a  finite  or  infinitely  great  ordinate.  In  such  cases  even  moderate  grouping 
may  result  in  throwing  away  the  greater  part  of  the  information  which  the  sample 
provides. 

3.  Distribidion  of  Observations  in  a  Dilution  Series. 

An  important  type  of  discontinuous  distribution  occurs  in  the  application  of  the 
dilution  method  to  the  estimation  of  the  number  of  micro-organisms  in  a  sample  of 
water  or  of  soil.  The  method  here  presented  was  originally  developed  in  connection 
with  Mr.  Cutler's  extensive  counts  of  soil  protozoa  carried  out  in  the  protozoological 
laboratory  at  Rothamsted,  and  although  the  method  is  of  very  wide  application,  this 
particular  investigation  affords  an  admirable  example  of  the  statistical  principles 
involved. 

In  principle  the  method  consists  in  making  a  series  of  dilutions  of  the  soil  sample, 
and  determining  the  presence  or  absence  of  each  type  of  protozoa  in  a  cubic  centimetre 
of  the  dilution,  after  incubation  in  a  nutrient  medium. 

The  series  in  use  proceeds  by  powers  of  2,  so  that  the  frequency  of  protozoa  in  each 
dilution  is  one-half  that  in  the  last. 

The  frequency  at  any  stage  of  the  process  may  then  be  represented  by 

n 
2-^' 

when  X  indicates  the  number  of  dilutions. 

Under  conditions  of  random  sampling,  the  chance  of  any  plate  receiving  0,  1,  2,  3 
protozoa  of  a  given  species  is  given  by  the  Poisson  series 
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and  in  consequence  the  proportion  of  sterile  plates  is 

JO  —  e~"', 
and  of  fertile  plates 

q  =  1-e-'". 

In  general  we  may  consider  a  dilution  series  with  dilution  factor  a  so  that 

log  p  = , 

and  assume  that  s  plates  are  poured  from  each  dilution. 

The  object  of  the  method  being  to  estimate  the  number  n  from  a  record  of  the  sterile 
and  fertile  plates,  we  have 

L  =  S,  (logjij)  +S,(log  (]) 

when  Si  stands  for  summation  over  the  sterile  plates,  and  S2  for  summation  over  those 
which  are  fertile. 
Now 


Bp 


vq  , 


3  log  n  3  log  n 

so  that  the  optimum  value  of  n  is  obtained  from  the  equation, 


3L 


=  S,(log^)-Sj^logp)  =  0. 


3  log  II  V    &  -f'/        -  yg 

Differentiating  a  second  time, 

.rP~.  =  S.  (logp)  -S.,|£^i^^(log  P+1+  ^M^)|  ; 
3  (log  n)-  ^    ^  J-i        -\       „       X'r,  I-  q      n 


now  the  mean  number  of  sterile  plates  is  p.s,  and  of  fertih-  plates  gs,  so  that  the  mean 

value  of  — -; -,  IS 

3  (log  a)- 

5 =  sS  Ip  log  'p—p  log  p  (log  J0+  1  +^log  p)  \  =  — sS  "^^(log  p)  '■  , 

"■   logn  I  ?  '  t-?  -> 

the  summation,  S,  being  extended  over  all  the  dilutions. 

It  thus  appears  that  each  plate  observed  adds  to  the  weight  of  the  determination 
of  log  n  a  quantity 

w  =  |(logp)-. 
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We  give  below  a  table  of  the  values  of  p,  and  of  w,  for  the  dilution  series  log  p  —  2' 
from  X  ^  —4  to  x  ^  11. 


'  X. 

P- 

w. 

S  (w)  (per  cent.). 

-4 

0-00000014167 

0-000036 

0-002 

-3 

0-0003354626 

0-021477 

0-906 

2 

0-01831564 

0-298518 

13-485 

-1 

0-1353353 

0-626071 

39-865 

0 

0-3678794 

0-581977 

64-388 

1 

0-6065306 

0-385355 

80-625 

2 

0-7788009 

0-220051 

89-897 

3 

0-8824969 

0-117350 

94-842 

4 

0-9394110 

0-060565 

97-394 

5 

0-9692333 

0-030764 

98-690 

6 

0-9844965 

0-015-503 

99-343 

7 

0-9922179 

0-007782 

99-671 

8 

0-9961014 

0-003899 

99-836 

9 

0-9980488 

0-001951 

99-918 

10 

0-9990239 

0-000976 

99-959 

11 

0-9995119 

0-000488 

99-979 

Remainder 
Total   .  . 

0-000488 

2-373251 

For  the  same  dilution  constant  the  total  S  (w)  is  nearly  independent  of  the  particular 
series  chosen.     Its  average  value  being 


— j ,  or  in  this  case  2-373138.     The  fourth 

b  log  a. 

column  shows  the  total  weight  attained  at  any  stage,  expressed  as  a  percentage  of  that 

obtained  from  an  infinite  series  of  dilutions.     It  will  be  seen  that  a  set  of  eight  dilutions 

comprise  all  but  about  2  per  cent,  of  the  weight.     With  a  loss  of  efficiency  of  only  2  to 

2h  per  cent.,  therefore,  the  number  of  dilutions  which  give  information  as  to  a  particular 

species  may  be  confined  to  eight.     To  this  number  must  be  added  a  number  depending 

on  the  range  which  it  is  desired  to  explore.     Thus  to  explore  a  range  from  100  to  100,000 

per  gramme  (about  10  octaves)  we  shoitld  require  10  more  dilutions,  making  18  in  all, 

while  to  explore  a  range  of  a  millionfold,  or  about  20  octaves,  28  dilutions  would  be 

needed. 

In  practice  it  would  be  exceedingly  laborious  to  calculate  the  optimum  value  of  n  for 

each  series  observed  (of  which  38  are  made  daily).     On  the  advice  of  the  statistical 

department,  therefore,  Mr.  Cutler  adopted  the  plan  of  counting  the  total  number  of 

sterile  plates,  and  taking  the  value  of  n  which  on  the  average  would  give  that  number. 

When  a  sufficient  number  of  dilutions  are  made,  log  n  is  diminished  bv  -  log  a  for  each 

.s- 

additional  sterile  plate,  and  even  near  the  ends  of  the  series  the  appropriate  values  of 
n  may  easily  be  tabulated.  Since  this  method  of  estimation  is  of  wide  application, 
and  appears  at  first  sight  to  be  a  very  rough  one,  it  is  important  to  calculate  its  efficiency. 
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For  any  dilution  the  variance  in  the  number  of  sterile  plates  is 

spq, 
and  as  the  several  dilutions  represent  independent  samples,  the  total  variance  is 

sS  {-pq), 
hence 

O 

Now  S  ipq)  has  an  averacre  value  ,— ^=-^  ,  therefore  taking  a  =  2, 
■'  log  a 

{Yogaf  =  -480453, 

and 

S(m)  =  1 

being  very  nearly  constant  and  within  a  small  fraction  of  unity  ;   whence  the  efficiency 
of  the  method  of  counting  the  sterile  plates  is 

-^— =  8771  per  cent., 

TT-  log  2 

a  remarkably  high  efficiency,  considering  the  simplicity  of  the  method,  the  elficiency 
being  independent  of  the  dilution  ratio. 

13.  Summary. 

During  the  rapid  development  of  practical  statistics  in  the  past  few  decades,  the 
theoretical  foundations  of  the  subject  have  been  involved  in  great  obscurity.  Adequate 
distinction  has  seldom  been  drawn  between  the  sample  recorded  and  the  hypothetical 
population  from  which  it  is  regarded  as  drawn.  This  obscurity  is  centred  in  the  so-called 
"  inverse  "  methods. 

On  the  bases  that  the  purpose  of  the  statistical  reduction  of  data  is  to  obtain  statistics 
which  shall  contain  as  much  as  possible,  ideally  the  whole,  of  the  relevant  information 
contained  in  the  sample,  and  that  the  function  of  Theoretical  Statistics  is  to  show  how 
such  adequate  statistics  may  be  calculated,  and  how  much  and  of  what  kind  is  the 
information  contained  in  them,  an  attempt  is  made  to  formulate  distinctly  the  types 
of  problems  which  arise  in  statistical  practice. 

Of  these,  problems  of  Specification  are  found  to  be  dominated  by  considerations  which 
may  change  rapidly  during  the  progress  of  Statistical  Science.  In  problems  of  Distri- 
bution relatively  little  progress  has  hitherto  been  made,  these  problems  still  affording 
a  field  for  valuable  enquiry  by  highly  trained  mathematicians.  The  principal  purpose 
of  this  paper  is  to  put  forward  a  general  solution  of  problems  of  Estimation. 
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Of  the  criteria  used  in  problems  of  Estimation  only  the  criterion  of  Consistency  has 
hitherto  been  widely  applied  ;  in  Section  5  are  given  examples  of  the  adequate  and 
inadequate  application  of  this  criterion.  The  criterion  of  Efficiency  is  shown  to  be  a 
special  but  important  case  of  the  criterion  of  Sufficiency,  which  latter  requires  that  the 
whole  of  the  relevant  information  supplied  by  a  sample  shall  be  contained  in  the  statistics 
calculated. 

In  order  to  make  clear  the  nature  of  the  general  method  of  satisfying  the  criterion 
of  Sufficiency,  which  is  here  put  forward,  it  has  been  thought  necessary  to  reconsider 
B ayes'  problem  in  the  light  of  the  more  recent  criticisms  to  which  the  idea  of  "  inverse 
probability  "  has  been  exposed.  The  conclusion  is  drawn  that  two  radically  distinct 
concepts,  both  of  importance  in  influencing  our  judgment,  have  been  confused  under 
the  single  name  of  probabilUy.  It  is  proposed  to  use  the  term  likelihood  to  designate 
the  state  of  our  information  with  respect  to  the  parameters  of  hypothetical  populations, 
and  it  is  shown  that  the  quantitative  measure  of  likelihood  does  not  obey  the  mathe- 
matical laws  of  probability. 

A  proof  is  given  in  Section  7  that  the  criterion  of  Sufficiency  is  satisfied  by  that  set 
of  values  for  the  parameters  of  which  the  likelihood  is  a  maximum,  and  that  the  same 
function  may  be  used  to  calculate  the  efficiency  of  any  other  statistics,  or,  in  other 
words,  the  percentage  of  the  total  available  information  which  is  made  use  of  by  such 
statistics. 

This  quantitative  treatment  of  the  information  supplied  by  a  sample  is  illustrated  by 
an  investigation  of  the  efficiency  of  the  method  of  moments  in  fitting  the  Pearsonian 
curves  of  Type  III. 

Section  9  treats  of  the  location  and  scaling  of  Error  Curves  in  general,  and  contains 
definitions  and  illustrations  of  the  intrinsic  accuracy,  and  of  the  centre  of  location  of  such 
curves. 

In  Section  10  the  efficiency  of  the  method  of  moments  in  fitting  the  general  Pearsonian 
curves  is  tested  and  discussed.  High  efficiency  is  only  found  in  the  neighbourhood  of 
the  normal  point.  The  two  causes  of  failure  of  the  method  of  moments  in  locating  these 
curves  are  discussed  and  illustrated.  The  special  cause  is  discovered  for  the  high 
efficiency  of  the  third  and  fourth  moments  in  the  neighbourhood  of  the  normal  point. 

It  is  to  be  understood  that  the  low  efficiency  of  the  moments  of  a  sample  in  estimating 
the  form  of  these  curves  does  not  at  all  diminish  the  value  of  the  notation  of  moments  as 
a  means  of  the  comparative  specification  of  the  form  of  such  curves  as  have  finite  moment 
coefficients. 

Section  12  illustrates  the  application  of  the  method  of  maximum  lilcelihood  to  dis- 
continuous distributions.  The  Poisson  series  is  shown  to  be  sufficiently  fitted  by  the 
mean.  In  the  case  of  grouped  normal  data,  the  Sheppard  correction  of  the  crude 
moments  is  shown  to  have  a  very  high  efficiency,  as  compared  to  recent  attempts  to 
improve  such  fits  by  making  x"  a  minimum  ;  the  reason  being  that  x^  is  an  expression 
only  approximate  to  a  true  value  derivable  from  likelihood.     As  a  final  illustration  of 
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the  scope  of  the  uew  process,  the  theory  of  the  estimation  of  micro-organisms  by  the 
dilution  method  is  investigated. 

Finally  it  is  a  pleasure  to  thank  Miss  W.  A.  Mackenzie,  for  her  valuable  assistance 
in  the  preparation  of  the  diagrams. 
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Introduction. 

Since  the  first  half  of  the  nineteenth  century,  it  has  been  known  that  certain 
species  of  protozoa  will  grow  in  a  suitable  medium  inoculated  with  soil.  In  spite  of 
this,  however,  some  later  woi'kers  have  asserted  that  such  organisms  did  not  occur  in 
normal  soils,  while  others,  though  recognising  their  presence,  denied  the  existence  of 
active  forms. 

To  Martin  and  Lewin(28)  must  be  ascribed  the  distinction  of  first  showing  that 
amoebae  and  flagellates  are  present  in  field  soil  in  a  trophic  condition  ;  and  owing  to 
the  labours  of  numerous  investig:ators,  it  is  now  known  that  most  soils  contain  these 
organisms  ;  but  unfortunately  there  is  little  or  no  knowledge  as  to  their  behaviour  in 
natural  conditions. 

Earlier  work  (8)  showed,  however,  that  an  investigation  of  the  micro-population  of 
field  soil  would  yield  interesting  and  important  results,  especially  as  it  was  evident 
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that  certain  views  held  by  soil  biologists  required  modification  ;  thus  it  has  always 
been  thought  that  the  bacterial  numbers  in  the  soil  changed  very  slowly,  and 
Investigators  have  not  hesitated  to  take  counts  at  monthly  or  even  longer  intervals. 
This  assumption  is  incorrect,  since  the  numbers  of  both  protozoa  and  bacteria  vary 
enormously  from  day  to  day. 

These  large  changes  appeared  so  important,  both  from  the  point  of  view  of  general 
biology  and  that  of  soil  economy,  that  the  present  investigation  was  undertaken  with 
the  purpose  of  finding  in  what  way  the  changes  in  any  one  group  or  species  are  related 
to  those  in  another  ;  also  what  influence  external  conditions,  such  as  temperature, 
moisture  and  rainfall,  have  upon  the  soil  micro-organisms. 

The  method  adopted  was  to  make  systematic  counts  of  the  numbers  of  bacteria 
and  of  six  species  of  protozoa  in  soil  samples  taken  daily  direct  from  the  field,  and,  by 
statistical  methods,  to  correlate  these  counts  one  with  another  and  with  data  for 
external  conditions. 

Observations  at  shorter  periods  than  24  hours  could  not  be  made  owing  to  the 
labour  involved  ;  but  it  was  found  possible  to  organise  a  team  of  workers  so  that 
soil  samples  were  taken  at  the  same  hour  (10  o'clock)  for  365  consecutive  days 
(July  5,  1920-July  4,  1921).*  In  this  way  a  large  body  of  data  was  obtained, 
which  is  presented  in  the  following  pages.  It  is  recognised  that  with  so  much 
material  there  are  probable  interesting  deductions  to  be  made  which  have  been 
overlooked  by  us.  In  an  appendix,  therefore,  the  salient  figures  are  given  to  allow 
those  interested  to  apply  further  statistical  tests. 

Species  of  Protozoa. 

Of  the  species  of  protozoa  recorded,  the  following  six  (two  amcebae  and  four 
flagellates)  occurred  constantly  in  suflicient  numbers  to  admit  the  application  of 
statistical  methods  to  the  results  : — 

Dimastigamceha  gruheri  (Schardinger). — This  species  has  been  described  in  detail 
by  Wilson  (36)  under  the  name  of  Nwgleria  gruheri.  It  is  also  identical  with  the 
Amceba  punctata  of  Dangeard  (10)  and  Vahlkampjla  soli  of  Martin  and  Lewin  (27). 

Sj^ecies  a. — A  very  small  limax  amoeba,  which  has,  up  to  the  present,  not  been 
identified  with  any  species  previously  described.  Even  when  fully  extended  it 
rarely  exceeds  lO/x  in  length.  Its  movements  are  slow,  and  it  is  never  observed  to 
travel  far  in  any  one  direction.  More  frequently  short,  knobby  pseudo podia  are 
formed  first  at  one  point  and  then  at  another,  and  withdrawn  again  without  any 
progression  having  been  efiected.  In  the  hving  animal  the  protoplasm  is  refractile, 
and  does  not  appear  to  be  vacuolated  or  to  show  any  structure,  while  the  nucleus  is 
not  visible    and    no    contractile   vacuole    has   been  observed.     Staining   reveals    the 

*  Our  thanks  are  due  to  Dr.  M.  L.\i'rie  and  to  Mr.  1'.  II.  H.  Gray  for  their  vahiable  assistance  in 
the  examination  of  the  culture  plates. 
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nucleus  as  composed  of  a  karyosome  and  a  very  thin  nuclear  membrane,  with  no 
peripheral  chromatin.  Nuclear  division  is  promitotic,  resembling  that  described  for 
A.  Jroschi  (29),  and  therefore  corresponding  to  the  second  Umax  type  in  Glaser's{16) 
classification. 

The  cyst  is  large  in  proportion  to  the  size  of  the  active  form,  the  diameter  being 
about  8ju.  The  outer  wall  is  thick  and  has  numerous  pores,  with  the  result  that  the 
cyst  contents  have  a  pentagonal  or  polygonal  appearance  in  optical  section.  A  very 
thin  inner  wall  is  closely  applied  to  the  cyst  contents,  which  contain  a  large  refractile 
granule  similar  to  that  occurring  in  many  flagellate  cysts.  A  full  account  of  this 
species  will  be  published  later. 

Ileteromita  sp. — The  differentiation  between  the  various  species  of  Heteromita  is 
in  many  cases  not  easy,  and  owing  to  the  large  number  of  cultures  that  had  to  be 
examined  each  day,  it  has  not  been  attempted.  Much  the  commonest  form  is  one 
agreeing  closely  with  the  description  of  Bodo  repens  (Klebs,  22),  but  the  published 
descriptions  do  not  make  the  distinctions  between  this  species  and  Bodo  glohosus 
(PusCHKAEEW,  30)  at  all  clear,  and  possibly  both  species  have  been  present. 

Heteromita  lens  {Bodo  Jens,  Muller,  Heteromastix  lens,  Woodcock,  37)  occurred 
in  the  cultures,  and  was  also  recorded  under  the  name  Heteromita,  but  it  was  not 
sufficiently  numerous  to  affect  the  results. 

Species  y. — This  is  au  hitherto  uudescribed  species.  It  is  the  smallest  of 
the  soil  flagellates,  varying  from  8  /x,  to  6  /u,  in  length  and  from  2  /a  to  3  /a  in 
greatest  breadth.  The  body  is  slightly  curved  and  a  single  flagellum  arises  on  the 
concave  side  just  behind  the  anterior  end.  Its  movements  are  very  like  those  of 
Heteromita,  since  the  single  flagellum  adheres  to  the  glass  slide  and  is  trailed  quite 
passively,  while  the  hind  end  of  the  body  vibrates  rapidly  from  side  to  side.  The 
nucleus  is  near  the  anterior  end  and  no  contractile  vacuole  has  been  found.  A  further 
account  of  this  species  will  be  published  separately. 

Gercomonas  sp. — Both  C.  longicauda  and  C.  crassicauda  were  apparently  present, 
but  no  attempt  was  made  to  distinguish  between  them. 

Oicomonas  termo  (Ehbenb.). — This  is  probably  identical  with  Monas  termo,  found 
by  Martin  in  sick  soils.  Martin  (25),  however,  was  unable  to  find  a  contractile 
vacuole,  whereas  in  the  present  species  a  very  distinct  contractile  vacuole  is  often 
present  at  the  base  of  the  flagellum. 

Species  not  Counted  Daily. 

The  dilutions  employed  in  counting  were  chosen  to  give  the  maximum  accuracy  for 
the  most  numerous  species,  and  although  several  other  protozoa  were  recorded 
regularly  in  small  numbers.ithe  results  obtained  are  not  sufficiently  accurate  to  justify 
detailed  discussion.     These  less  numerous  species  are  the  followmg  : — 

Amceba  sp.— A  binucleate  amoeba  which  occurred  almost  every  day  in  numbers  up 
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to  fifty  per  gramme.  In  appearance  it  is  identical  with  A.  diploidea  (Hartmann  and 
Nagler,  18),  but  its  life-history  has  not  yet  been  completely  worked  out. 

Nuclearia  simplex  (Cienkowsky).— This  species  is  not  included  by  Cash  (2)  in  his 
list  of  British  fresh-water  Rhizopoda.  It  is  distinguished  from  N.  delicatula  by  its 
smaller  size,  the  diameter  of  both  active  and  cystic  forms  being  only  between  15  ju, 
and  25  ^i.  It  was  found  every  day,  and  at  times  its  numbers  rose  to  as  many  as  3,000 
per  gramme. 

Chlamydophrys  stercorea  (Cienk.). — A  much  less  numerous  form  than  the  preceding. 

Monas  sp. — This  species  occurred  constantly  in  small  numbers. 

Proleptomonas  /(Bcicola. — Described  by  Woodcock  (37)  as  a  coprozoic  species.  It 
occurred  rather  less  regularly  than  the  preceding  forms,  but  occasionally  in  consider- 
able numbers. 

Colpoda  cucullus  (0.  F,  M.). — Constantly  present,  but  rarely  exceeding  fifty  per 
gramme. 

Other  forms  recorded  occasionally  in  the  course  of  the  year  were  : — 

Rhizopoda. 
Amceba  glebce  (Dobell,  14). 

Amoebce  limicola  (Rhumbler,  29a). — Rarely  found. 

Hartmannella  hyalina  (Dangeard,  9). 

Amoeba  sp. — This  is  probably  Arachnula  impatiens  (Cienk.,  13).  Only  living 
forms  were  examined  and  these  resembled  Dobell's  description  very  closely  in 
their  habits  and  in  the  absence  of  any  apparent  nucleus. 

Di(p,ugia  sp.^ — This  would  probably  have  been  found  much  more  frequently  had  it 
been  practicable  to  incubate  the  plates  for  a  longer  period. 

Flagellata. 
Cop>romonas  suhtilis  (Dobell,  12),  Sjnromonas  angusta  (Kent),  Tetramitus 
rostratus  (Perty),  Tetramitus  sjjiralis  (Goodey,  17a),  Helhesimastix  fcecicola 
(Woobcock),  Phyllomitus  amijlojjliagus  (Klebs),  Ophidomonas  sp.,  Bodo  edax 
(Klebs)  has  been  obtained  from  the  same  plot  of  land,  but  was  not  observed  at  all 
during  the  course  of  the  count. 

Ciliata. 

Colpoda  steinii,  a  much  less  abundant  species  than  C.  cucullus,  Colpidium  colpoda, 
PUurotricha  sp.,  Gastrostyla  sp.,  Balantiophorus  sp.,  Enchelys  sp. 

Influence  of  External  Conditions  on  Protozoa  and  Bacteria. 

The  numbers  of  all  the  organisms  counted  show  large  fluctuations  of  two  kinds, 

viz.,  daily  and  seasonal.     The  size  of  the  changes  that  take  place  within  so  short  a 

time   as   24    hours    is    perhaps   the    most   surprising   fact   that  the    experiment    has 

revealed.      Thus  the  three  consecutive  samples  246,   247  and   248  give  58'0,   14-25 
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and  26'25  million  bacteria  respectively.  The  changes  exhibited  by  any  of  the 
species  of  protozoa  are  at  times  even  larger,  and  affect  the  active  and  encysted 
stages  apparently  quite  indifferently.  It  is,  in  fact,  not  uncommon  for  the  numbers 
of  any  species  to  vary  from  a  few  hundred  or  a  few  thousand  to  400,000  or  more 
withm  24  hours,  and  the  percentage  of  these  in  the  active  condition  may  jump  from 
0  to  nearly  100  in  a  similar  interval  [e.g.  Dimastigamceba,  samples  250,  251  ; 
Cereomonas,  samples  221,  222  ;  Oicomonas,  samples  245,  246  ;  etc.). 

In  spite  of  these  apparently  quite  erratic  daily  changes,  when  the  numbers  are 
averaged  over  a  considerable  period  {e.g.  fourteen  days),  marked  seasonal  changes 
became  apparent.  The  whole  soil  population  rises  to  a  maximum  at  the  end  of 
November,  and  falls  again  to  a  minimum  at  the  end  of  February  or  early  in  March. 
The  bacteria  then  rise  to  a  further  very  high  figure  at  the  end  of  June.  The 
protozoa  are  found  to  fall  into  two  well-marked  groups  in  respect  of  these  seasonal 
changes.  The  curves  for  Cereomonas  and  Dimastigamceba  (fig.  l)  run  practically 
parallel  to  one  another,  except  for  a  short  period  in  September,  1920,  and  another  in 
June,  1921.  The  curve  for  Heteromita  also  shows  a  considerable  similarity  to  these, 
but,  unfortunately,  no  great  reliance  can  be  placed  on  the  results  obtained  for  this 
organism,  owing  to  the  frequency  with  which  its  numbers  exceeded  the  maximum 
that  could  be  counted  with  the  dilutions  used.  The  curves  for  sp.  a,  sp.  y  and 
Oicomonas  show  similar  parallelism  to  one  another  (fig.  2).  Thus,  from  the  end  of 
December  till  the  final  sample,  the  parallelism  between  the  curves  for  sp.  a  and 
Oicomonas  only  breaks  down  at  a  single  point,  while  the  similarity  between  the 
curves  for  sp.  a  and  sp.  y  during  the  final  four  months  is  even  more  striking. 

The  various  physical,  chemical  and  biological  changes  which  might  be  expected  to 
account  for  these  daily  and  seasonal  changes  in  numbers  will  be  considered  below. 
In  attempting  to  establish  a  relationship  either  between  the  numbers  of  the  different 
organisms  or  between  any  one  of  them  and  any  of  the  external  factors,  the  method 
adopted  is  to  arrange  aU  the  data  in  the  form  of  a  contingency  table  (Table  II, 
p.  328).  Thus,  in  comparing  any  two  variants,  an  increase  in  either  is  denoted 
by  a  +  ■  sign  and  a  decrease  by  a  —  sign,      -j-  +  will  then  indicate  that  both  the 

variants  are  increasing  together, that  they  are  decreasing  together,  and  -j 

or 1-  that  they  are  changing  in  opposite  directions.     If  there  is  no  relationship 

between  the  two,  the  total  number  of  like  signs  ( +  +  added  to )  throughout 

the  year  should  be  equal  to  the  total  number  of  unlike  signs  {-\ added  to 1-)- 

Any  departure  from  equality  will  indicate  a  correlation  between  the  two. 

(1)  Influence  of  Soil  Moisture. 

It  has  sometimes  been  claimed  that  the  numbers  of  soil  organisms  are  very  closely 

linked  up  with  the  soil  moisture.     Thus  ENGBERDiNG(14a)  states  :  "  Im  gewachsenen 

Feldboden  steigt  und  fallt  die  Bakterienzahl  in  der  warmeren  Jahreszeit  mit  seinem 

Wassergehalte,"  and  France  (146)  has  recently  extended  this  dictum  to  the  whole  of 
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Fortnightly  averages  of  total  numbers  of  Hekronufa,  Oerconwnas  and  Dimastigamceba,  and  of  bacteria, 

moisture  and  temperature. 

the  soil  population.  The  present  short-interval  counts,  however,  lend  no  support  to 
this  assumption.  The  changes  in  moisture-content  of  the  soil  from  one  day  to  the 
next  are  invariably  small,  and  frequently  there  is  no  appreciable  change  for  several 
successive  days,  while  the  numbers  of  all  the  organisms  may  be  fluctuating 
enormously  {e.g.  samples  222,  223,  224,  225).  It  is  impossible,  therefore,  to  explain 
the  daily  changes  in  the  soil  population  by  changes  in  moisture,  and  even  the 
seasonal  moisture  changes  have  no  clear  influence.  There  is  a  very  striking 
difference   in   the   figures   obtained  for   nearly   all    the    organisms   at   the  beginning 
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Fortnightly  averages  of  total  numbers  of  Oicomonas,  sp.  y  and  sp.  a.,  and  of  bacteria,  moisture 

and  temperature. 

of  Julv,  1920,  from  those  for  the  end  of  June,  1921,  and  it  is  tempting  to  assume 
that  these  difierences  are  in  some  way  connected  with  the  fact  that  the  summer  of 
1920  was  very  wet,  whereas  1921  was  a  year  of  unprecedented  drought.  Contrary 
to  the  assumptions  quoted  above,  it  is  the  drought  which  is  in  most  cases  associated 
with  the  high  numbers.  When  the  curves  for  average  moisture  and  average 
numbers  are  compared  in  greater  detail,  however,  it  becomes  impossible  to  draw 
any  generalisations.     Thus  with  the  bacteria,  during  July  and   August,  when  the 
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soil  was  drying,  the  numbers  were  fairly  constant ;  they  rose  during  the  following 
months,  when  the  moisture  was  increasing,  but  fell  considerably  during  December, 
while  the  moisture  was  still  increasing.  Early  in  April  the  numbers  rose  sharply, 
while  the  moisture-content  was  also  rising,  but  during  May  and  June,  when  the 
greatest  rise  in  numbers  occurred,  the  soil  was  drying  rapidly.  If  the  curves  for  any 
of  the  protozoa  are  considered  in  the  same  way,  their  independence  of  the  moisture 
changes  is  apparent. 

(2)  Influence  of  Rainfall. 

It  has  been  suggested  (34)  that  a  heavy  shower  of  rain,  by  increasing  the  aeration 
of  the  soil  solution,  may  have  a  beneficial  effect  on  the  micro-organisms  living  in  it. 
Obviously,  rainfall  cannot  be  responsible  for  all  the  changes  in  numbers  that  were 
found,  for  they  occurred  even  when  no  rain  fell.  On  thirty  occasions  during  the 
year  the  rainfall  exceeded  0'2  inches,  and  the  changes  in  numbers  following  these 
heavy  rains  are  recorded  in  Table  I. 


Table  I. — Influence  of  Rainfall  of  0'2  inches  or  more  on  the  Soil  Population. 


Bacteria. 

Dimas- 
tiganwsba. 

Sp.a. 

Heteromita. 

Sp.y. 

Cerco- 
rtumas. 

Oicomonas. 

Total 
Protozoa. 

Numbers  rise    . 

19 

16 

9 

u 

7 

15 

14 

75 

„       fall     . 

10 

11 

18 

11 

19 

13 

15 

87 

No  change   .     . 

1 

3 

3 

5 

4 

2 

1 

18 

Ratio   '^''^'^^ 

cysts 

,,         rises    . 

— 

12 

9 

12 

16 

16 

10 

75 

falls     . 

— 

12 

18 

11 

12 

11 

18 

82 

No  change    .     . 

— 

2* 

3 

0* 

2 

1* 

1* 

9 

*  These  figures  are  incomplete,  as  it  is  impossible  to  calculate  the  ratio  *£JX?.  when  the  species  in 

cysts 

question  was  found  in  every  plate  of  the  series  (see  Appendix  I,  "  Experimental  Methods,"  p.  340). 

These  figures  do  not  indicate  any  significant  connection  between  rainfall  and  the 
daily  changes  in  number,  and  this  conclusion  is  borne  out  by  a  close  examination  of 
the  records.  Thus,  on  April  1 3,  when  the  first  heavy  shower  for  over  two  months 
occurred,  the  bacterial  numbers  fell  by  875  millions,  whereas  on  May  8,  after  three 
weeks  of  dry  weather,  a  heavy  shower  was  accompanied  by  a  rise  of  ll'l  millions.  It 
is  also  noteworthy  that  the  greater  part  of  the  autumnal  rise  in  numbers  occurred  at 
a  time  when  the  rainfall  was  low,  while  the  numbers  found  at  the  end  of  November, 
and  again  at  the  end  of  December  and  early  in  January  (figs.  1,  2),  do  not  seem  to  be 
influenced  by  the  heavy  rains  which  occurred  during  these  periods.  Conversely  large 
changes  in  the  average  numbers  sometimes  occurred  (as  at  the  end  of  March)  without 
any  corresponding  change  in  the  rainfall. 
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(3)  Influence  of  Soil  Temperature* 

It  is  apparent  from  Table  II,  section  7,  that  rises  and  falls  of  temperature  are 
associated  quite  at  random  with  the  daily  changes  in  numbers  of  Dimastigamceha  and 
Cercomonas.  A  similar  conclusion  applies  to  all  the  species  of  protozoa  recorded. 
With  the  bacteria,  the  same  result  was  established  by  determining  the  coefficient  of 
correlation  between  the  daily  changes  in  the  mean  soil  temperature  and  the  daily 
change  in  the  bacterial  numbers,  by  the  usual  formula  r  =  2  (x.  ?/./.  )/n.  o-j .  o-o.f 
The  value  found  was  r  =  — 0']49  ^  O'Ol,  which  is  too  small  to  indicate  any  connection 
between  the  two,  and  shows  conclusively  that  rising  temperature  does  not  cause 
increased  bacterial  numbers. 

A  similar  conclusion  follows  from  an  examination  of  the  curves  in  figs.  1,  2.  The 
seasonal  changes  in  the  numbers  of  the  various  organisms  show  no  connection  with 
the  seasonal  temperature  changes.  The  big  autumnal  rise  in  numbers  took  place 
while  the  temperature  was  steadily  falling,  and  one  organism,  sp.  a,  actually  reached 
its  maximum  at  the  coldest  period  of  the  year.  The  spring  rise  in  numbers,  on  the 
other  hand,  was  associated  with  steadily  rising  temperature.  The  unbroken  rise  in 
soil  temperature  between  the  middle  of  April  and  July  does  not  appear  to  have 
affected  the  piotozoal  numbers,  and  though  the  average  bacterial  numbers  rose  very 
much  during  the  period,  the  rise  was  broken  by  two  large  temporary  falls,  which  show 
that  the  increase  cannot  be  explained  simply  by  the  rising  temperature. 

During  the  fortnight,  December  6  to  19,  the  mean  soil  temperature  reached  a 
minimum,  owing  to  the  occurrence  of  several  days'  continuous  frost.  During  this 
period,  the  bacteria  and  all  the  protozoa,  except  sp.  a,  showed  a  fall  in  numbers. 
A  similar  spell  of  frost  occurred  in  February,  and  this  time  the  numbers  of  all 
the  protozoa  were  reduced,  but  the  bacteria  showed  a  slight  rise.  These  observa- 
tions are  not  very  striking  in  themselves,  but  are  of  some  interest  in  view  of  the 
conclusion  arrived  at  by  Kussell  and  Appleyard  (34),  that  the  amount  of  COj 
in  the  soil  air  is  independent  of  temperature,  except  when  the  latter  falls  below 
5°  C,  and  then  it  is  reduced.  It  is  probable,  therefore,  that  during  a  really  severe 
winter  the  number  of  soil  organisms  may  be  considerably  depressed,  but  owing  to 
the  mildness  of  the  winter  in  which  the  present  counts  were  made,  the  results 
obtained  are  not  conclusive  on  this  point.     Even  while  the  surface  of  the  soil  was 

*  The  mean  soil  temperature  has  been  reckoned  by  the  method  recommended  by  Keen  and 
Russell  (21),  in  which  the  average  is  found  of  the  maximum  air  temperature  and  of  the  minimum 
recorded  by  the  12-inch  soil  thermometer  during  the  24  hours.  The  readings  used  were  those  taken 
daily  at  10  a.m.  at  the  Rothamsted  meteorological  station,  situated  some  200  yards  from  the  plot;  tests 
showed  that  the  temperatures  recorded  at  the  two  places  were  identical. 

j  r  =  coefficient  of  correlation  ;  n  =  total  number  of  cases  ;  /  =  number  of  cases  in  class  ;  x  --=  deviation 
of  class  from  mean  temperature  change  ;  y  =  deviation  of  class  from  mean  change  in  bacterial  numbers  ; 
(Ti  =  standard  deviation  of  mean  temperature  changes ;  o-o  =  standard  deviation  of  changes  in  bacterial 
numbers. 
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frozen,  however,  the  numbers  of  all  the  organisms  showed  large  apparently  random 
daily  fluctuations  just  as  at  other  times.  It  was  only  when  averages  were  taken  that 
the  depressions  became  apparent. 


A.cta>e  ruinubers  of  Cerco/tzonxis  fronx  Jiib/  3^  tcAag  /6'^  ]920. 


(4)  Influence  of  Farm  Operations. 
An  increase  in  the  numbers  of  bacteria  in  a  field  soil  during  the  autumn  has  been 
observed  by  numerous  workers  in  the  past,  and  is  commonly  attributed  either  to  the 
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aerating  effect  of  autumn  ploughing,  or  to  the  nutritive  effect  of  manuring.  Neither 
of  these  explanations  apply  in  the  present  case,  for  none  of  the  farm  operations  (see 
Appendix  I)  were  followed  by  any  abnormal  changes  in  numbers,  and  no  manure  was 
applied  during  the  years  1920  and  1921. 

(5)  InjlHence  of  Chemical  Changes  in  the  Soil. 

In  preparing  the  soil  sample  each  day  for  counting  protozoal  cysts,  10  c.c.  of  the 
acid  was  always  withdrawn  just  before  the  dilutions  were  made,  and  was  titrated  with 
alkali  in  order  to  ascertain  that  its  concentration,  after  neutralising  the  carbonate  of 
the  soil,  was  still  sufficient  to  kill  all  the  active  jirotozoa.  Slight  differences  were 
found  from  day  to  day  In  the  amount  of  alkali  required  for  the  titration,  and  these 
must  correspond  to  differences  in  the  chemical  conditions  of  the  soil  samples,  but  no 
connection  could  be  found  between  them  and  the  changes  in  the  number  of  any  of  the 
organisms. 

(6)  Physiological  Influences — Autointoxication. 

A  population  of  500,000  individuals  of  any  species  of  protozoa  per  gramme  of  soil 
represents  about  2,500,000  pei-  c.c.  of  soil  solution,  for  20  per  cent,  may  be  taken  as  a 
typical  value  of  the  water  content  of  Rothamsted  soil.  If,  as  is  not  impossible,  the 
organisms  are  not  distributed  uniformly  throughout  this  medium,  but  are  aggregated 
at  some  particular  points  among  the  soil  particles,  their  density  may  be  sufficiently 
high  for  the  accumulation  of  their  waste  products,  in  the  absence  of  constant  renewal 
of  the  solution,  to  exert  a  depressing  effect  on  the  numbers  as  commonly  occurs  in 
artificial  cultures.  During  the  year,  it  was  sometimes  noticed  that  exceptionally  high 
numbers  of  any  organism  on  any  day  were  followed  by  low  numbers  the  next  day,  as 
if  autointoxication  had  taken  place.  But  an  examination  of  the  complete  figures  for 
the  whole  year  does  not  substantiate  this  hypothesis. 

Discussion. 

From  the  foregoing  sections  it  apjjears  that,  contrary  to  all  expectations,  no 
connection  can  be  traced  between  climatic  conditions  or  the  other  external  factors 
considered  and  either  the  daily  or  the  seasonal  changes  in  the  numbers  of  any  of  the 
soil  oi-ganisms  investigated.  It  is,  however,  difficult  to  resist  the  conclusion  that 
some  such  connection  must  exist.  The  similarity  between  the  seasonal  curves  for  all 
the  different  organisms  suggests  that  they  are  reacting  together  at  any  rate  to  the 
larger  climatic  changes.  There  is,  moreover,  small  but  significant  correlation  between 
the  daily  changes  of  some  of  the  protozoa.  Table  II,  section  2,  shows  that  there  is  a 
definite  excess  of  like  signs  when  the  daily  changes  in  any  two  flagellates  are  taken 
together,  and  also  when  Dimastigamceba  is  taken  with  either  Cercomonas  or 
Heteromita. 

The  similarity  that  was  observed  between  the  seasonal  changes  in  Diinastigamceba, 
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Cercomonas  and  Heteromita  therefore  extends  also  to  some  extent  to  the  daily- 
changes  both  in  their  cystic  and  active  numbers.  On  the  other  hand  although  the 
seasonal  curve  for  sp.  a  is  so  strikingly  parallel  to  those  for  sp.  y  and  Oicomonas 
the  daily  fluctuations  of  this  amoeba  are  quite  independent  of  the  other  two  species. 
The  absence  of  a  preponderance  of  unlike  signs,  however  (except  in  the  case  of  the 
amcebse  and  bacteria  which  will  be  considered  separately,  p.  :^3l),  shows  clearly  that 
there  is  no  antagonism  between  any  of  the  ditferent  organisms.  It  appears,  therefore, 
that,  though  all  the  organisms  respond  alike  to  seasonal  environmental  changes,  their 
responses  to  the  more  temporary  environmental  changes  are  much  less  uniform.  From 
the  available  data,  however,  it  is  impossible  to  say  what  is  the  actual  connection 
between  the  changes  in  numbers  of  any  of  the  soil  organisms  and  the  changes  in  their 
environment.  In  view  of  the  magnitude  of  the  changes  in  numbers  that  occur  from 
one  day  to  the  next,  it  is  obviously  desirable  to  obtain  some  knowledge  of  the  changes 
that  take  place  within  the  course  of  a  single  day  and  night ;  but  this  is  at  present 
impracticable,  and  probably  the  only  way  of  advance  at  present  is  through  an 
extensive  series  of  laboratory  experiments  under  rigidly  controlled  conditions. 

It  is  interesting  to  note  that  the  seasonal  variations  in  the  numbers  of  the  soil 
organisms  is  very  similar  to  those  which  have  frequently  been  recorded  in  the  case  of 
many  aquatic  organisms.  Miss  Delf(II),  for  instance,  found  that  in  the  ponds  at 
Hampstead  the  alg^  are  most  numerous  in  the  early  spring  (February  to  the 
beginning  of  May)  and  again  in  October  and  November,  some  species  disappearing 
entirely  during  the  intervals.  Somewhat  similar  seasonal  changes  have  been  recorded 
in  a  pond  by  Fritsch  and  EiCH  (15),  in  the  British  lakes  by  West  and  West  (35),  in 
the  Illinois  River  by  Kofoid  (22a),  and  elsewhere. 

In  the  sea  also  the  planktonic  diatoms  have  a  great  pulse  of  activity  between 
March  and  June,  and  the  dinoflagellates  attain  their  maximum  about  a  month  later. 
Both  groups  have  a  smaller  second  maximum  in  the  late  autumn  (19). 

Thus  the  maxima  for  the  soil  population  coincide  closely  with  those  of  the  mai'ine 
and  fresh-water  plankton,  except  that  whereas  the  aquatic  forms  are  most  active  in 
spring  and  only  show  a  minor  phase  of  activity  in  autumn,  the  reverse  appears  to  be 
true  in  the  soil.  It  is  difficult  to  resist  the  conclusion  that  the  annual  variations  are 
produced  by  similar  causes  in  each  case.  If  this  assumption  is  correct,  it  follows  that 
the  seasonal  changes  in  protozoal  numbers  in  the  soil  are  not  determined  solely  by  the 
number  of  bacteria  present,  in  spite  of  the  fact  that  bacteria  are  probably  the  only 
food  supply  of  the  species  in  question,  for  the  aquatic  algae  which  show  such  closely 
similar  seasonal  changes  are  not  dependent  on  such  a  food  supply.  This  conclusion  is 
supported  by  the  fact  that  the  number  of  all  the  protozoa  except  Oicomonas  rose 
during  March,  whereas  the  corresponding  increase  in  the  bacteria  was  delayed  till  the 
beginning  of  April. 

The  annual  fluctuations  in  aquatic  organisms  agree  further  with  those  in  the  soil 
organisms  in  having  little  if  any  close  relation  with   temperature.     A  marked  annual 
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periodicity  was  found  by  Apstein  (l)  even  in  a  lake  in  Ceylon,  where  the  total  range 
of  temperature  during  the  year  was  from  25*94°  C  in  November  to  28*28°  in  May. 
On  the  other  hand,  the  fresh-water  organisms  are  influenced  very  considerably  by  rain- 
fall and  general  hydrographic  conditions  (see  especially  Apstein  (1)  and  Kofoid  (22a)), 
whereas  no  such  influence  could  be  foiuid  in  the  soil.  Solar  energy  also  undoubtedly 
influences  the  development  of  the  phyto-plankton  (15),  and  Herdman  (19)  regards  it 
as  being  probably  the  chief  cause  of  the  great  increase  of  life  in  the  sea  in  the  spring. 
It  is  interesting  to  note  that  seasonal  changes  very  similar  to  those  found  in  the 
soil  bacteria  have  been  recorded  for  sewage  bacteria.  At  the  Dorking  sewage  works, 
Dr.  (now  Sir)  A.  C.  Houston  (31)  found  the  highest  numbers  during  October, 
November  and  December,  while  during  February,  March  and  April  the  numbers  were 
consistently  lower  than  at  any  other  time  of  the  year. 

Eelation  between  Protozoa  and  Bacteria. 

Comparison  of  the  numbers  of  bacteria  with  those  of  the  active  amoebae  [Dimastig- 
amosba  and  sp.  a)  shows  a  definite  inverse  relationship  between  them.  A  contingency 
table  (Table  II,  sec.  1 )  foi'med  from  these  numbers  shows  that  they  vary  inversely 
255*5  times  and  together  83*5  times,  with  a  probable  error  ±9*2  (figs.  4,  5,  and  G. 
Samples  320-327).  If  the  numbers  of  active  individuals  of  Dimastigamceba  and 
sp.  a  are  considered  separately  with  the  bacteria,  the  result  is  as  follows  : — 

Like.  Unlike. 

Dimastigamceba  and  bacteria    .     .     .      101  238  (70  per  cent,  imlike). 

Sp.  a  and  bacteria 166*5  193*5  (54  per  cent,  unlike). 

In  both  cases  the  difference  in  numbers  is  significant,  but  obviously  Dimastigamceba 
is  very  much  more  effective  in  its  action  on  bacteria  than  is  sp.  a  ;  but  in  fifty- 
three  cases  it  has  either  remained  unchanged  or  varied  with  bacteria,  while  sp.  a  has 
varied  inversely  with  them.  Such  cases  occur  at  Samples  34,  41,  49,  etc.  (fig.  6, 
Samples  162,  163).  An  examination  of  the  data  shows  that  generally,  where  the 
influence  of  sp.  a  on  the  bacterial  numbers  is  apparent,  the  numbers  oi  Dimas  tig  amoeba 
are  either  changing  very  slightly  (in  twenty  of  the  fifty-three  cases  they  have  not 
changed  at  all),  or  else  the  changes  in  sp.  a  are  large  {e.g.  Samples  137,  275). 

Out  of  the  339  days  considered  in  the  formation  of  tliese  contingency  tables, 
Dimastigamceba  shows  a  controlling  influence  on  the  bacteria  70  per  cent,  times,  and 
sp.  a  another  16  per  cent.  There  remain  then  14  per  cent,  or  forty-eight  days  on 
which  the  changes  in  numbers  in  the  bacteria  cannot  apparently  be  attributed  to  the 
action  of  the  amcebse,  since  they  vary  directly  and  not  inversely  with  each  other. 
Certain  of  these  can  be  explained  by  reference  to  the  amoebae  if  the  cysts  as  well  as 
the  active  forms  are  taken  into  consideration.  At  Sample  28  the  numbers  of  sp.  a 
are  practically  unchanged,  but  the  bacteria  and  Dimastigamceba  have  risen  ;  in  the 
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Numbers  of  active  amoebaB  (Bimastiganuxha  and  sp.  a)  and  bacteria  in  1  grm.  of  field  soil  for  typical 

periods  in  February  and  April. 

latter,  however,  the  rise  is  entirely  due  to  excystment,  so  that  the  influence  upon  the 
bacteria  would  naturally  have  been  slight,  especially  if  the  excystment  did  not  occur 
until  near  the  time  of  sampling.  At  Sample  35  sp.  a  is  again  unchanged,  the  bacteria 
fall,  and  in  Dimastigamoeha  the  active  numbers  fall  slightly,  though  the  totals  rise 
by  8000,  sho\ving  that  considerable  activity  has  gone  on  in  the  preceding  twenty- 
four  hours,  terminating  in  cyst  formation.  At  Sample  212  (fig.  6)  there  is  a  similar 
case ;    the  bacteria   and  active   DimastigamoehcB  fall,  and  though  sp.    a   rises,    the 
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Numbers  of  amoebiB  and  bacteria  per  gramme  of  soil  for  short  periods  in  December,  1920,  and 

May  and  February,  1921. 

increase  (1000)  is  scarcely  enough  to  account  for  the  bacterial  change,  but  there  has 
been  a  large  increase  in  the  cysts  of  Dimastigamoeba  (97,000).  At  Sample  297  there 
is  an  increase  in  both  active  amoebae  and  in  the  bacteria,  but  in  the  amoebae  the 
change  is  due  to  excystment,  as  in  Sample  28.  Sample  321  resembles  35,  but  in  this 
case  it  is  the  cysts  of  sp.  a.  which  have  increased  by  11,000.  At  Samples  362  and 
363  sp.  a  may  be  disregarded,  as  the  numbers  are  almost  stationary  ;  the  bacteria  fell 
on  both  days,  and  on  both  there  were  no  active  DimastigaTnceha  when  the  samples 
were  taken.  Again,  there  is  evidence  of  activity  during  the  time  between  the  counts, 
as  Sample  362  shows  a  fall  in  both  cysts  and  totals,  so  death  is  not  likely  to  have 
taken  place  until  excystment  and  consequent  activity  had  occurred,  while  Sample  363 
shows  a  rise  in  cysts  and  totals,  again  implying  that  there  has  been  an  intervening 
period  of  activity.  There  are  numerous  other  cases  where  the  same  thing  has 
occurred,  but  in  these  the  active  numbers  at  the  time  of  sampling  are  sufficient  to 
account  for  the  bacterial  changes. 

From  an  inspection  of  the  figures  it  is  not  possible  to  decide  whether  the  inter- 
action is  primarily  one  in  whicli  the  amoebae  influence  the  bacteria  or  vice  versd.  The 
possibility  of  the  bacterial  changes  initiating  those  in  the  amoebae  is,  however,  entirely 
ruled  out  by  the  fact  that  the  contingency  tables  show  no  positive  correlation 
between  the  changes  in  numbers  of  the  active  forms  in  the  two  amoebae. 
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Periodicity. 
Preliminary  experiments  (8)  had  shown  a  remarkable  periodic  fluctuation  in  the 
numbers  of  active  flagellates  in  the  soil,  high  numbers  on  one  day  being  succeeded  by 
low,  which  were  again  followed  by  high  on  the  third  day.  The  extended  observa- 
tions here  described  have  confirmed  this  phenomenon  and  have  shown  it  to  be 
characteristic  of  only  one  species  of  flagellates,  viz.,  Oicornonas  termo  (Ehren.).  The 
rhythm  has  been  maintained,  with  few  exceptions,  for  the  365  days  and  is  shown  in 
figs.  7  and  8  for  different  periods  of  the  year. 
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Daily  variations  in  the  numbers  of  active  individuals  of  a  species  of  soil  flagellate,  Oicornonas  termo 

(Ehrenb.),  during  July,  1920. 

It  can  be  shown  that  if  there  were  no  periodicity,  and  if  the  fluctuations  were 
purely  random,  then  the  number  of  like  signs  in  a  contingency  table  made  for  the 
active  numbers  on  successive  days  would  be  33^  per  cent,  and  the  number  of  unlike 
signs  66|  per  cent,  of  the  total  observations.  The  observed  figures  give  a  jaercentage 
of  only  15'08±r89  like  signs,  which  is  highly  significant  (see  Table  II,  sec.  5). 

The  possibility  was  considered  that  an  alternation  of  encystation  and  excystation 
might  partially  account  for  the  variation  of  the  active  forms ;  this,  however,  does  not 
hold,  for  the  cyst  curve  is  not  the  inverse  of  the  active  one.  Also,  if  the  hypothesis 
were  correct,  the  cysts  should  show  a  periodicity.  To  test  this  possibility  the 
numbers  of  cysts  on  successive  days  were  compared,  but  gave  33'43±2-49  per 
cent,  like  signs,  which  is  practically  the  expectation,  if  excystation  and  encystation 
are  wholly  non-periodic.     The  further  test  was  made  of  comparing  active  forms  of 
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one  day  with  the  cystic  of  the  next.  The  result  was  199  hke,  160  unlike  signs. 
Now,  if  there  is  no  connection  between  the  two,  that  is,  if  high  active  numbers  do 
not  tend  to  cause  cyst  formation,  or,  conversely,  the  expectation  would  be  50  per 
cent,   like    to    50    per    cent,    unlike  signs,   which   is  not   widely   different   from   the 
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Daily  variations  in  the  numbers  of  active  individuals  of  a  species  of  soil  flagellate,  Oicomonas  termu 

(Ehrenb.),  during  March,  1921. 
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numbers  obtained.  There  is,  however,  a  small  significant  excess  of  like  signs, 
brought  about  by  the  tendency  for  a  drop  in  active  numbers,  to  be  followed  by 
a  similar  drop  in  the  number  of  cysts  (see  Table  II,  sec.  5).  The  significance  of 
this  tendency  is,  however,  sufficiently  small  to  render  it  almost  negligible.  Thus  the 
hypothesis  of  regular  excystation  and  encystation  will  not  completely  account  for  the 
periodicity  of  the  active  forms.  As  would  be  expected  from  the  above  considerations, 
the  total  numbers  of  Oicomonas  are  periodic  to  a  small  extent.  Thus  the  comparison 
of  the  totals  for  successive  days  gives  27'33  per  cent.i2"35  of  like  signs,  which 
is  significant,  though  not  to  as  high  a  degree  as  that  of  the  active  forms.  Also, 
as  is  seen  in  Table  II,  sec.  5,  the  unlike  signs  predominate  when  the  actives  and 
totals  for  successive  days  are  compared. 

It  may  be  deduced,  therefore,  that  increased  active  numbers  tend  to  be  followed 
by  decreased  totals,  that  is  death,  conjugation,  or  both,  while  falls  in  the  active 
numbers  are  followed  by  rises  in  the  totals— reproduction. 

It  has  been  found  impossible  to  correlate  any  obvious  external  condition  with  the 
rhythm,  indeed,  it  is  difiicult  to  conceive  of  a  periodic  external  condition  such  as  would  be 
necessary;  and,  moreover,  pure  cultures  in  artificial  media  maintained  at  constant  tem- 
perature in  the  laboratory,  exhibit  a  precisely  similar  periodicity  of  active  numbers. 

It  is  concluded,  therefore,  that  the  solution  of  the  problem  must  be  sought  in  the 
organism  itself,  though  it  is  realised  that  environmental  conditions,  such  as  food 
supply,  will  play  a  part  in  the  complexity  of  factors  involved. 

Other  Species. 

The  active  numbers  of  D.  gruheri,  when  compared  for  successive  days,  also  show  a 
tendency  for  the  active  forms  to  alternate  from  day  to  day.  But  it  is  not  so  marked 
as  in  the  case  of  Oicomonas,  since  the  unlike  signs  are  only  76"7  per  cent.,  as  against 
the  expected  6 6  "6  per  cent.  The  other  species  show  no  tendency  to  periodicity  when 
tested  for  a  two  days'  period,  for  their  active  numbers  are  found  distributed  in  a 
random  way.  It  is  possible  that  a  rhythm  of  a  longer  interval — three,  four,  or  five  days 
— may  obtain,  but  this  can  be  tested  only  by  experiment  in  controlled  culture  media. 

In  no  case  is  there  evidence  that  cyst  formation  occurs  regularly  ;  in  every  species 
considered,  the  results  of  comparisons  between  actives  and  cystic,  or  between  the 
cystic  numbers  for  alternate  days,  are  what  would  be  expected  on  the  supposition  of 
random  excystation  and  encystation,  or  the  departure  from  expectation  is  too  small 
to  be  worth  comment.  Thus,  although  excystation  and  encystation  play  a  part  in 
the  variation  of  the  active  numbers,  the  preponderating  influence  is  death,  conjuga- 
tion and  reproduction,  causing  variation  in  the  total  numbers  of  the  species. 

Summary. 

1.  The  results  of  365  consecutive  daily  counts  of  the  numbers  of  bacteria  and  of 
six  species  of  protozoa  in  a  normal  field  soil  are  given,  and  the  methods  of  counting 
bacteria  and  protozoa  are  described. 
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2.  The  numbers  of  both  bacteria  and  protozoa  rarely  remain  the  same  from  one  day 
to  the  next.  The  fluctuations  are  very  great,  but  it  has  not  been  found  possible  to 
connect  them  with  meteorological  or  general  soil  conditions. 

3.  Fourteen-day  averages  of  the  daily  numbers  demonstrate  that  well-marked 
seasonal  changes  in  the  soil  population  are  superimposed  on  the  daily  variations  in 
numbers.  In  general,  both  bacteria  and  protozoa  are  most  numerous  at  the  end  of 
November  and  fewest  during  February.  These  changes  are  not  directly  influenced 
by  temperature  or  rainfall,  but  show  a  similarity  to  the  seasonal  fluctuations  recorded 
for  many  aquatic  organisms. 

4.  There  Is  a  slight  tendency  for  the  various  species  of  flagellates  to  fluctuate 
together  from  day  to  day,  but  this  is  not  shown  by  the  two  species  of  amoeba. 

5.  An  inverse  relationship  is  found  between  the  numbers  of  bacteria  and  active 
amoebae  in  86  per  cent,  of  the  total  observations. 

6.  A  two-day  periodicity  obtains  for  the  active  numbers  of  one  species  of  flagellate, 
Oicomonas  termo. 

APPENDIX   I. 

Experimental  Methods. 

Sampling  of  Soil. 

The  samples  of  soil  were  taken  from  the  plot  on  Barnfield,  Rothamsted,  which 
annually  receives  farmyard  manure  at  the  rate  of  14  tons  to  the  acre.  The  portion 
of  the  plot  chosen  was  apj^roximately  20  yards  long  by  12  yards  broad.  No  crop  was 
grown  during  the  period  of  the  experiment,  and  no  manure  added  in  the  year  1920  ; 
but,  in  July-August,  1920,  it  was  horse-hoed;  on  February  5-7,  1921,  ploughed;  on 
March  3,  cultivated  ;  on  April  15-16,  27-28,  rolled  ;  and  in  May,  June,  and  July,  hoed. 

The  samples  were  taken  at  the  same  time  (10  a.m.)  every  day,  the  method  being  as 
follows  : — 

Six  9-inch  borings,  taken  at  intervals  of  about  6  feet  in  a  line  across  the  plot,  and 
thoroughly  mixed  together  formed  Sample  1,  which  was  taken  to  the  laboratory  in  a 
sterile  tin. 

The  second  sample  was  taken  on  the  following  day  in  the  same  manner  and  on  the 
same  line  as  the  first,  except  that  the  borings  were  made  about  2  feet  away.  The  third 
sample  borings  were  made  about  2  feet  from  those  of  the  second  sample,  thus  com- 
pleting the  first  line  across  the  plot.  (See  Map,  p.  339.)  For  the  next  three  samples  a 
second  line  about  9  inches  from  the  first  was  used,  and  so  on  to  the  end  of  the  series. 
This  method  of  sampling  was  adopted  to  obviate,  as  far  as  possible,  errors  due  to 
unequal  distribution  of  the  micro-organisms  over  the  plot,  also  to  ensure  against  such 
an  error  control  experiments  were  made.  On  three  occasions  four  samples  were  taken 
simultaneously  across  the  plot,  and  counted.  The  results  for  the  protozoa  and  bacteria 
were  in  no  case  significantly  different  in  the  four  samples.  Further,  for  the  last 
fifteen  days  a  duplicate  sample  was  taken  6  yards  back,  and  the  bacteria  counted  ; 
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the  results  of  the  two  counts  were  concordant.  These  experiments  justify  the  belief 
that  unequal  distribution  in  the  field  cannot  account  for  the  daily  fluctuations  In 
number  of  the  organisms. 
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Map  showing  method  of  taking  soil  samples,  each  sample  being  formed  from  six  borings,  and  each 
boring  being  denoted  by  the  respective  numeral. 

Bacteria. 
The  nutrient  medium  employed  is  a  modification,  prepared  by  Mr.  H.  G.  Thornton 
and  as  yet  unpublished,  of  one  described  by  Conn.     It  consists  of: — 


Distilled  water  . 
KoHPO,  .  .  . 
KNO3.  .  .  . 
MgSO^  .  .  . 
CaCl,,  KCl,  NaCl 
Asparagine  . 
Agar  .... 
FeCL    .... 


1 

1      g 
0-5 

0-2 

0-1 

0-5 

15-0 

0-002 


litre. 


each. 


After  the  substances  have  melted  in  a  steamer  1  grm.  of  mannite  is  added,  and  the 
Ph  value  adjusted  to  7  "3. 

Since  this  medium  contains  only  simple  chemical  substances  it  is  not  liable  to  the 
variations  which  often  occur  in  successive  preparations  containing  peptone,  lemco,  or 
gelatine.  Such  variations  have  been  found  to  be  sources  of  considerable  error  in 
counting. 

The  bacteria  are  counted  by  a  dilution  method,  practicable  direct  means  being  yet 
unknown  ;  10  grm.  of  soil,  which  have  passed  through  a  3-mm.  sieve,  are  placed  in 
250  c.c.  of  sterile  physiological  salt  solution,  contained  in  a  750  c.c.  flask,  and  shaken 
for  5  minutes,  thus  giving  a  1/25  dilution.  1  c.c.  of  this  is  shaken  for  1  minute  in 
100  c.c.  of  saline  contained  in  a  300  c.c.  flask  ;  1  c.c.  of  this  second  dilution  is  shaken 
for  1  minute  in  a  100  c.c.  of  saHne  contained  in  a  300  c.c.  flask,  thus  giving  a  dilution 
of  1/250,000.  Of  this  1  c.c.  quantities  are  placed  in  the  bottom  of  five  separate 
sterile  Petri  dishes  ;  5  c.c.  of  the  nutritive  medium  are  added,  and  a  thorough 
admixture  ensured  by  rotating  the  dish.  After  incubation  at  20°  C.  for  7  days  the 
total  number  of  colonies  are  counted,  and  the  numbers  of  bacteria  in  millions  per 
gramme  calculated. 

Strict  adherence  to  the  details  of  the  technique  is  necessary,  otherwise  the  results 
are  not  reliable. 
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The  data  obtained  have  been  examined  statistically  by  Mr.  R.  A.  Fisher,  and  in 
nearly  80  per  cent,  of  the  365  daily  counts  of  bacterial  numbers,  the  distribution  of 
variation  in  the  numbers  of  colonies  on  each  plate  is  strictly  that  vrhich  should  be 
obtained  by  random  sampling  as  calculated  by  the  Poisson  series.  This  series 
represents  the  mathematical  consequences  of  an  accurate  technique  of  sampling 
and  dilution.  That  the  10  per  cent,  of  exceptions  (marked  with  an  asterisk  in 
Table  IV)  are  not  extreme  cases  of  systematic  errors  affecting  the  whole  of  the 
data,  is  shown  by  the  sharpness  with  which  they  may  be  distinguished  from  the 
rest  by  the  statistical  tests,  and  by  the  conformity  of  the  90  per  cent,  to  the 
theoretical  distribution.  For  these  latter  counts  the  probable  error  would  therefore 
be  approximately  2/3  of  the  standard  error,  which  depends  solely  on  the  total 
number  of  colonies  counted.  Thus,  if  the  total  number  of  colonies  counted  on  a 
series  of  plates  is  291,  this  figure  is  affected  by  a  standard  error  of  \/291  =  17'1, 
or  by  a  probable  error  17'l-j-2/3  =  11  "4.  Since  this  example  was  obtained  from  a 
four-plate  series,  the  result  in  millions  per  gramme  is  obtained  by  dividing  by  16 
giving  18-2  ±071. 

The  probable  error  varies  with  each  count  made,  but  for  the  whole  series  (excluding 
the  exceptions)  it  is  between  4-6  per  cent. 

Protozoa. 

The  protozoa  are  counted  by  a  dilution  method,  the  essentials  of  which  have  already 
been  described  (7).  In  the  present  experiment  the  three  ranges  of  dilutions  employed 
at  different  times  of  the  year  (see  Table  IV)  were  1/25-1/102,400,  1/50-1/204,800 
and  1/50-1/409,600.  It  has  been  shown  (6,  7)  that  a  more  complete  census  of  the 
soil  population  is  obtained  if  the  Petri  dishes,  inoculated  from  each  dilution,  are 
incubated  for  twenty-eight  days  and  examined  weekly,  since  a  few  protozoa,  e.g. 
Chlamydophrys,  develop  only  slowly  ;  but  it  was  impracticable  to  continue  observa- 
tions for  so  long  owing  to  the  great  labour  involved.  The  counts,  therefore,  refer 
only  to  those  species  which  reproduce  sufficiently  rapidly  to  give  accurate  results  after 
fourteen  days'  incubation.  Even  with  this  restriction  over  100  plates  were  examined 
daily,  and  there  were  always  about  700  plates  in  the  incubator  room. 
The  numbers  of  organisms  in  the  soil  sample  are  deduced  as  follows  : — 
For  a  given  soil  population  the  average  number  of  negative  plates  in  a  series  (that 
is  plates  showing  no  protozoan  development)  can  be  calculated,  conseqiiently  any 
observed  number  of  negative  plates  will  represent  a  certain  density  of  population  in 
the  soil  sample  investigated.  Such  calculations  have  been  done  by  Mr.  II.  A.  Fisher 
who  compiled  Table  III  for  us.  Thus  in  a  series  of  dilutions  ranging  irom 
1/25-1/102,400  five  negative  plates  will  represent  an  average  population  of  16.000 
per  gramme  of  soil.  The  probable  error  of  observations  made  in  this  way  can  best  be 
regarded  as  one  of  plate  difference,  a  two-plate  difference  not  being  significant,  thus  a 
four-negative  plate  result  is  not  significantly  different  from  a  two-  or  six-negative 
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plate  result.  Such  a  probable  error  holds  so  long  as  the  number  of  plates  falls  in  the 
intermediate  portion  of  the  series,  at  each  extreme  end  (one  or  two  negative  plates  or 
twenty-four  or  twenty-five  negative  plates)  the  error  becomes  considerable  and 
difficult  to  calculate.  Fortunately  in  our  experiments  the  number  of  organisms  with 
the  exception  of  Heteromita  sp.  has  been  such  as  to  give  negative  plates  within  the 
range  of  accuracy. 

The  accuracy  of  the  method  has  been  rigorously  tested  by  a  series  of  experiments 
given  in  a  earlier  paper  (6),  and  the  results  obtained  with  Oicomonas  termo  (see  p.  335) 
afford  further  proof  of  the  efficiency  of  the  technique.  This  animal  shows  a  two-day 
periodicity  of  the  active  forms  which  can  hardly  be  ascribed  to  experimental  error. 

The  number  of  negative  plates  are  given  in  Table  IV  and  the  actual  figures  may  be 
obtained  by  reference  to  Table  III. 


Table  III. — Tables  showing  the  Number  of  Soil  Organisms  per  gramme  of  Soil 
corresponding  to  the  Number  of  Negative  Plates  in  the  various  Dilution 
Series  used. 


1. 

2. 

3. 

Dilutions  25-102,400. 

Dilutions 

50-204,800 
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No.  of 

Protozoa 

No.  of 

Protozoa 

No.  of 

Protozoa 

sterile  plates. 

per  gramme. 

sterile  plates. 

per  gramme. 

sterile  plates. 

per  gramme. 

1 

110,000 

1 

220,000 

1 

420,000 

2 

59,000 
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118,000 
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230,000 

3 

36,000 
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72,000 
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4 

23,000 
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46,000 
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6 
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220 
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1.  General  Problem  of  Evaluating  the  Effects  of  Weather  on  Crops. 

At  the  present  time  very  little  can  be  claimed  to  be  known  as  to  the  effects  of  weather 
upon  farm  crops.  The  obscmnty  of  the  subject,  in  spite  of  its  immense  importance  to 
a  great  national  industry,  may  be  ascribed  pai-tly  to  the  inherent  complexity  of  the 
problem  which  it  presents,  aijd  more  especially  to  the  lack  of  quantitative  data  obtainetl 
either  under  experimental  or  under  industrial  conditions,  by  the  study  of  which  accurate 
knowledge  alone  can  be  acquired.  Of  the  industrial  applications  of  such  knowledge  it 
is  unnecessary  to  speak  here  in  detail.  It  is  sufficient  to  indicate  that  the  present  system 
of  Life  Insurance,  which  safeguards  the  economic  stability  of  many  thousands  of  families, 
and  occujjies  the  activities  of  many  of  the  greatest  financial  corporations,  was  niade 
jjossible  by  the  stucUes  of  statistics  of  human  mortality  by  the  mathematicians  of  the 
eighteenth  and  nineteenth  centuries,  and  that  on  the  basis  of  adequate  knowledge  similar 
economic  stability  with  its  attendant  security  of  capital  should  be  within  the  reach  of  the 
industrial  farmer. 
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The  inherent  complexity  of  the  relationships  which  it  is  sought  to  elucidate,  between 
the  yields  of  farm  crops,  and  the  previous  weather  which  largely  controls  those  yields, 
arises  primarily  from  the  complexity  of  the  problem  of  specifying  the  weather  itself. 
Meteorologists  have,  however,  gradually  devised  a  number  of  instrumental  observations, 
which  although  far  from  specifying  the  total  environment  of  the  growing  plant,  as 
understood  by  the  plant  physiologist,  do  nevertheless  give  a  sufficiently  detailed  accoimt 
of  the  general  environmental  conditions  of  the  growing  crop,  in  so  far  as  these  vary  from 
season  to  season.  It  is  probable  indeed  that  almost  all  the  weather  influences  to  which 
crop  variations  are  due  could  be  exj^ressed  in  terms  of  the  instrumental  observations 
of  a  modern  meteorological  station.  The  actual  difficulty  of  calculating  the  crop  varia- 
tions from  given  instrumental  records  is,  however,  immense  ;  only  an  attack  of  the  most 
preliminary  kind  upon  the  general  problem  can  be  attempted  in  this  paper.  The  complete 
aim  of  agricultural  meteorology  should,  however,  be  emphasised,  for  it  is  only  by  its 
substantial  achievement  that  other  causes  of  crop  variation  can  be  freed  from  much 
obscurity. 

A  valuable  example  of  an  investigation  in  which  this  aim  was  clearly  held  in  view  is 
to  be  found  in  A.  Walter's  study  in  1910  of  the  effects  of  weather  upon  the  Sugar  crop 
in  Mauritius  (17).  Walter  was  able  to  give  a  very  complete  account  of  the  meteorological 
causes  affecting  the  yield  of  sugar,  to  such  an  extent  that  this  author  considers  that  under 
miiform  estate  management  and  methods  of  cultivation  the  yield  may  be  predicted  from 
meteorological  data  only  with  a  standard  error  of  the  order  of  1  per  cent.  The  number  of 
harvests  available  for  this  enquiry  seems,  however,  to  have  been  only  13,  and  more 
recent  investigations  in  statistical  theory  make  it  now  possible  to  reahse,  more  clearly 
than  was  at  the  time  possible,  some  of  the  dangers  which  arise  from  paucity  of  crop  data, 

2.  Effects  of  jxiucity  of  Crop  Data. 
The  biometrical  investigations  for  which  the  method  of  multiple  correlation  was 
developed,  differ  from  such  agricultural  studies  as  the  present  in  two  main  particulars. 
In  the  first  place  the  number  of  individuals  measured  was  taken  to  be  large,  of  the  order 
of  1 .000.  or  at  least  of  some  himdreds,  and  the  special  problems  of  distribution  which 
arise  in  small  samples  have  only  more  recently  begun  to  receive  attention.  In  the  second 
place  the  number  of  ineasurements  taken  of  each  individual,  or  the  number  of  variates, 
was  generally  small,  and  in  all  cases  fax  smaller  than  the  number  of  meteorological 
elements  which  may  plausibly  be  regarded  as  affecting  the  crop.  The  sequence  of  wealher 
to  u'liicli  rr..p  viuiatiojis  may  be  ascribed  extends  over  a  year,  or  even  more,  and  consists 
in  tliis  c.uutry  of  a  series  of  abrupt,  relatively  violent  and  transient  spells,  each  of  which 
has  its  inHueiice  on  the  crop.  If  we  wished  to  analyse  the  sequence  no  more  closely 
than  by  monthly  averages,  we  should  still  have  12  values  for  rainfall,  and  12  more  for 
maximum  and  minimum  temperature,  dew  point,  grass  minimum,  solar  maximum  and 
soil  temperatures,  nor  would  it  be  unreasonable  to  include  some  such  measure  of  insola- 
tion as  is  given  by  "  Hours  of  Bright  Sunshine,"  and  averages  for  the  direction  and  forcQ 
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of  the  wind.  The  number  of  meteorological  elements  might  be  made  to  exceed  even 
the  longest  series  of  crop  records  available,  for  the  Rothamsted  wheat  records  provide, 
after  necessary  deductions,  only  60  yields.  Consequently,  if  the  computer  were  pro- 
vided not  with  yield  data  at  all,  but  with  an  equal  number  of  values  composed  at  random, 
he  would  still  be  able  to  express  them  with  perfect  accuracy  in  terms  of  the  weather 
records,  for  the  number  of  unknowns  available  would  exceed  the  number  of  equations 
for  them  to  satisfy. 

It  is  of  more  practical  importance  that  even  when  we  have  selected  a  niimber  of 
meteorological  variables  which  is  less  than  the  number  of  crops  recorded,  a  strong 
semblance  of  dependence  may  be  produced,  even  when  fictitious  data,  unrelated  to  the 
weather,  are  substituted  for  the  true  ciops.  For  let  there  be  «.  values  of  the  dependent 
variate  y  and  n  each  of  the  q  meteorological  variates  x\,  x.^,  ...  ,  x^.  If  now  we  obtain 
a  partial  regression  equation 

showing  the  apparent  dependence  of  y  upon  ajp  . . . ,  x^,  then  the  correlation  between 
y  and  Y,  namely  R,  will  be  the  multiple  correlation  of  y  with  x^,  ...  ,  x^,  and  will  be 
greater  than  the  correlation  of  y  with  any  other  linear  function  oi  x^  ,  ...  ,  x^.  Since 
R  is  necessarily  positive  (or  zero),  as  was  noticed  by  Hooker  (10,  p.  7),  there  will 
be  generally  an  appearance  of  correlation  produced  in  this  way,  even  if  no  such 
relation  really  exists.  This  effect  becomes  particularly  marked  if  the  number  in  the 
sample  is  small. 

When  there  is  no  real  correlation  it  is  possible  to  determine  a  priori  the  distribution 
of  the  multiple  correlation  of  any  quantity  y  with  any  q  variates  x^,  x^,  ... ,  x^. 
There  are  many  ways  of  approaching  the  solution,  of  which  perhaps  the  most 
enlightening  is  to  regard  the  deviations  y—i)  as  components  determining  the  direction 
of  a  radius  vector  OP  in  n  dimensions.  Then  q  other  directions  are  specified  by  the  q 
meteorological  variates,  and  any  linear  regression  formula  is  represented  by  some  Une 
through  the  origin  in  the  space  of  q  dimensions,  specified  by  these  q  directions.  The 
multiple  correlation,  R,  is  then  the  cosine  of  the  angle  which  the  line  OP  makes  with 
the  space  of  q  dimensions.  If  the  variate  y  is  unrelated  to  the  variates  x^,  ...  ,  x,^,  the 
line  OP  may  be  regarded  as  drawn  at  random  through  0,  in  the  space  of  n  dimensions  ; 
usino'  this  fact  it  may  be  shown  without  difficulty  that  the  chance  that  R-  falls  into 
the  elementary  range  dR^  is 

n —  3  j 


It  will  be  noted  that  the  distribiition  is  independent  of  any  correlations  which  may 
exist  between  the  meteorological  variates,  provided  no  one  of  them  can  be  exactly 

o2 
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calculated  from  the  others.  In  the  case  q  =  I,  when  only  one  independent  variate  is 
used,  the  formula  reduces  to 

df= ■ (1  -R')  -  f?R. 

which  is  known  (Fisher  (4),  1915)  to  be  the  distribution  of  the  correlation  (taken 
positive)  derived  from  a  sample  of  n,  between  two  uncorrelated  variates.      It  may 
also  be  seen  that  when  </  =  «  —  1,  R  is  necessarily  equal  to  -\-  1. 
The  mean  value  of  R-  is 

n  —  I 

and  since 

S(Y-,y)^  =  R^S(?/~//)^ 

we  may  say  that  in  the  absence  of  correlation,  the  variance  of  y  (the  square  of  its  standard 
deviation)  will  be  distributed  on  the  average  equally  between  the  q  degrees  of  freedom 
of  the  regression  formula,  and  the  {n  —  q  —  I)  degrees  of  freedom  in  which  y  departs 
from  the  regression  formula.  This  view  of  the  matter  enables  us  to  follow  the  above 
conclusions  to  some  extent  into  the  more  general  case  in  which  y  is  really  in  some 
degree  coiTelated  with  ;r„  ...,  x^.  For  if  y  be  imagined  as  made  up  of  two  parts, 
Vo  +  y\  such  that  ?/„  is  wholly  determined  as  a  linear  function  by  the  independent 
variates  and  y'  is  wholly  uncorrelated  with  them,  then  y^  and  y'  must  be  mutually  un- 
correlated, so  that  each  will  contribute  a  certain  j^ercentage  to  the  variance  of  y.  Let 
the  fraction  contributed  by  y^  be  A,  then  the  regression  formula  found  by  correlating 
a  sample  of  y'  with  the  independent  variates  will  differ  from  that  found  by  correlating 
y,  merely  by  y^  which  is  by  hypothesis  a  linear  function  oixi,  ...,  x^.  Consequently  the 
average  proportion  of  the  variance  in  the  regression  formula  for  y  will  be 

R^=A  +  ^-f^(l-A), 
or 

1  -  1^  =  (1  _  A)  "'  ~  ^  ~  ^ 

If  the  sample  be  increased  indefinitely  (%->  oo)  then  the  limiting  value  of  R^  is  A. 
An  estimate  of  A  from  a  finite  sample,  free  from  the  positive  bias  of  R^.  is  obtained 
by  taking 

1-Ai  =-ii^li— (1-R2) (II) 

A  complete  dismission  of  the  errors  of  random  sampling  in  multiple  regression  requires 
a  knowledge  of  the  frequency  distribution  of  R,  or  of  Aj,  calculated  by  equation  (II), 
in  the  general  case  when  A  is  not  zero  ;   equation  I  gives  this  distribution  only  when 
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A  =  0.  It  provides  a  means  of  testing  if  an  observed  value  of  R  differs  significantly 
from  zero,  but  not  for  testing  in  general,  whether  one  value  of  R^  is  significantly  greater 
tlian  another. 

From  equation  (I)  it  ajjpears  that  the  probability  that  R,  obtained  from  a  random 
sample  of  n  observations,  should  exceed  any  specified  value,  is,  if  q  is  even, 


P  =  fl  -^R2\=o.-?-i)  Ji  _i  "'  —  9  —  1  P2   I   {n--q—l){n~q  +  l) 


2.4 


R^ 


and,  if  q  is  odd, 

w  —  y  —  2  ^ 

P=    2  2 

^■K  H  —  y  —  3 , 
2 


2.4... (./-2)  '' 


dz 


(i+zT"- 


/  it- 

n-7-2, 


•\/7T  n  — 


R  (1  -  R^) 


Hn-q-l)  j  I 


4-'J—lx,. 


-R2 


I         {n-q)(n~q-^2)...{n~5)  p„_,\ 
3.5...(<z-2)  ]• 

The  analogy  of  these  formulae  with  those  giving  the  probability,  P,  that  //,  the 
Pearsonian  test  of  goodness  of  fit,  should  exceed  any  specified  value  is  obvious. 
It  will  be  noticed  that  in  the  second  formula  the  probability  integral  of  the  normal 
curve  is  replaced  by  that  of  the  Type  VII  curves  which  has  been  tabulated  by 
"  Student  "  (16,  1917).  When  q  is  small,  very  few  of  the  terms  of  the  series  are 
involved,  thus  for  g-  =  6,  or  7,  the  series  terminates  with  the  term  in  R'. 

The  effect  of  increasing  q,  the  number  of  variates  used,  is  to  increase  somewhat  rapidly 
the  probability  that  R  should  exceed  any  specified  value,  even  for  an  independent 
variate  wholly  uncorrelated  Avith  those  used  to  predict  it  ;  this  increase  may  be  exem- 
plified in  the  following  table,  where  we  have  taken  n  =  13,  q  =  4,  0.  8  and  calculated 
the  chance  of  R  exceeding  0-5,  0-6,  0-7,  0-8  and  0-9. 


Table  I.— ?i  =  13. 


9 

R=  0-5 

0-6 

0-7 

0-8 

0-9 

4 
6 
8 

0-6328 
0-8965 
0-9844 

0-4094 
0-7491 
0-9402 

0-2002 
0-5187 
0-8248 

0-0598 
0-2509 
0-590G 

0-0055 
0-0505 
0-2424 
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It  will  be  seen  that  the  chance  of  obtaining  a  value  R  >  0  •  &  is  only  one  in  1 80  for 
(/  =  4  ;  it  rises  to  one  in  twenty  for  g'  =  6,  and  about  one  in  foiu-  for  q  =  8.  For  q  —  S, 
therefore,  an  observed  multiijle  correlation  0  •  9  cannot  be  regarded  as  significant,  that 
is  as  convincing  evidence  that  the  "  crop  "  is  in  any  way  influenced  by  the  meteorological 
variates.  The  only  value,  in  fact,  in  the  above  table  which  could  be  regarded  as  definitely 
significant  is  the  value  0*9  for  q  =  4,  while  ior  q  =  Q  this  is  suggestive  only  of  real 
influence. 

A  still  more  insidious  source  of  illusory  high  correlations  lies  in  the  fact  that  the 
particular  variates  chosen  for  correlation  with  the  crop  figures,  are  often  chosen  because 
they  appear  in  fact  to  be  associated  with  the  crop.  It  is  a  common  practice,  as  a  pre- 
liminary to  the  study  of  weather  correlations,  to  search  for  the  so-called  critical  periods 
by  such  methods  as  the  following.  The  years  are  arranged  in  order  of  crop  yield,  and 
a  number  of  meteorological  values  are  plotted  on  a  chart  in  this  order  ;  those  meteoro- 
logical values  which  show  an  apparent  trend  upwards  or  downwards,  are  then  picked 
out,  and  used  to  construct  a  weather  formula  by  which  the  crop  may  be  predicted. 
Such  methods  are  especially  deceptive  when  the  series  of  years  is  shorts  (about  20  or  30), 
for  if  such  a  process  were  carried  out  thoroughly  we  shoidd  not  have  merely  the  random 
sampling  distribution  of  R,  which  as  we  have  seen  is  capable  of  yielding  sufficiently 
high  values  from  uncorrelated  material,  but  we  should  be  choosing  that  set  of  q  variates, 
out  of  a  larger  number  p,  which  gave  the  highest  value  of  R. 

The  effect  of  such  a  process,  if  dummy  data  were  substituted  for  the  actual  crop 
records,  could  be  accurately  foretold  by  the  solution  of  the  following  problem.  A  line 
OP  is  drawn  through  a  point  0,  at  random  in  ii  dimensions,  p  directions  (where  p  may 
exceed  n)  are  also  chosen  at  random,  and  of  them  q  (</<«)  are  selected  so  that  the  space 
of  q  dimensions  through  these  makes  the  least  possible  angle,  0,  with  OP.  Find  the 
sampling  distribution  of  0  (or,  of  R  =  cos  0). 

I  can  put  forward  no  general  solution  of  this  problem  ;  the  case  p  =  q  has  been 
solved  above.  The  additional  advantage  conferred  by  a  choice  of  variates  may  be  clearly 
seen  from  the  case  (/  =  1 .     For 


u 

2 

n  —  [ 
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n  — 
■      2 
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(1  -  R^)  -^  fZR 


represents  the  chance  that  for  a  single  uncorrelated  variate  chosen  at  random,  the 
correlation  in  a  sample  shall  not  exceed  R  ;  hence  it  follows  that  the  chance  that  for 
none  of  p  uncorrelated  variates  chosen  at  random,  does  the  correlation  exceed  R, 
must  be 


•"    2.?i^! 


v.  ' 


—  (1-R'')~(ZR 

4  1 


.-0  ^   ■•  •      2 
and  this  is  the  chance  that  the  highest  correlation  of  j;  shall  jiot  exceed  R. 
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If  a  standard  of  significance  be  chosen  such  that  it  will  be  exceeded  in  random  samples 
once  in  20  times,  the  effect  of  having  a  choice  of  p  variates  will  increase  the 
chance  of  exceeding  any  assigned  rare  value  nearly  ji  times.  Thus  for  «  =  13,  0*6 
will  not  be  judged  significant,  even  for  a  single  variate  correlated  if  p  exceed  2,  0-7 
ceases  to  be  significant  Up  >  1,  and  0 •  8  when  p  exceeds  about  54.  When  we  consider 
that  p  may  be  very  great,  since  not  only  is  the  mimber  of  meteorological  elements  avail- 
able for  selection  large,  but  also  since  many  Writers  alloW  themselves  to  use  complicated 
functions  of  the  instrument;)!  data,  involving  adjustable  weights,  special  conventions 
of  sign,  and  allowances  for  periods  judged  to  be  unusual  in  their  effects,  so  that  these 
artificial  meteorological  data  may  be  calculated  in  an  enormous  variety  of  ways,  it  is 
clear  that  the  conclusions  we  have  drawn  as  to  the  dangers  of  applying  multiple  corre- 
lation formulai  to  small  samples,  are  very  much  to  be  emphasised  when  a  choice  is  made 
as  to  what  meteorological  elements  to  correlate. 

In  view  of  the  foregoing  facts  it  would  seem  worth  while  to  lay  down  the  following 
conditions  for  arriving  at  unprejudiced  results  : — 

(i)  The  meteorological  variates  to  be  employed  nmst  be  chosen  without  reference 
to  the  actual  crop  record. 

(ii)  If  multiple  variates  are  to  be  used  allowance  must  be  made  for  the  positive 
bias  of  Rl 

(iii)  Ivelationships  of  a  complicated  character  should  be  sought  only  when  long 
series  of  crop  data  are  available. 


3.  The  A)uilysis  of  the  Season. 

By  the  Season  is  meant  the  whole  sequence  of  weather  directly  or  indirectly  influencing 
the  crop  from  its  inception  to  the  time  the  produce  is  weighed.  In  studying  the  influence 
of  the  rain  on  the  wheat  crop,  we  have  chosen  a  period  of  a  year  ending  on  the  31st  of 
August  of  the  year  in  which  the  crop  was  harvested.  To  obviate  the  irregidarity  of  the 
calendar,  this  year  was  taken  to  be  of  366  days,  so  that  it  commenced  on  either  the  31st 
of  August  or  the  1st  of  September  prior  to  the  sowing  of  the  seed.  Taking  into  con- 
sideration a  single  element,  namely  Rain,  only,  this  sequence  is  in  our  climate  sufficiently 
complex.  The  general  distribution  of  rain  through  the  year  could,  indeed,  be  roughly 
represented  by  dividing  the  period  up  into  say  6  sections  and  recording  the  total  rainfall 
in  each  period.  If  the  rainfall  were  approximately  evenly  distributed  in  successive  days 
or  weeks,  this  method  might  indeed  give  a  fairly  accurate  picture  of  the  sequence  of 
rainfall.  But  jiotoriously  the  rain  is  in  fact  often  concentrated  in  short  spells,  with 
rainless  periods  intervening,  consequently  if  March  and  April  formed  one  section,  a 
great  part  of  the  rain  ascribed  to  that  section  might  fafl  early  in  March  in  one  year,  and 
late  in  April  in  another,  and  it  would  be  impossible  to  regard  such  falls  as  equivalent 
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in  their  effects  on  the  crop.  Whereas  a  late  April  fall  might  well  be  nearly  equivalent 
to  one  occurring  early  in  May. 

This  consideration  suggests  that  a  finer  subdivision  of  the  year  is  necessary  ;  that 
months  or  even  weeks  should  be  treated  separately.  From  the  agricidtural  and  meteoro- 
logical point  of  view  there  is  everything  to  be  said  for  such  subdivision  ;  it  raises,  how- 
ever, new  mathematical  difficulties  ;  for  whereas  the  evaluation  of  deterniinants  of  6  rows 
and  columns,  or  the  solution  of  simultaneous  linear  equations  in  6  unknowns  is  sufficiently 
rapid,  disproportionate  labour  is  involved  in  increasing  the  6  to  12,  and  if  52  unknowns 
were  attempted  the  labour  involved  would  become  fantastic. 

A  consideration  which  has  been  still  more  influential  in  framing  our  method  is  that 
even  if  we  were  to  calculate  the  52  partial  regression  coefficients  showing  the  average 
effect  in  bushels  per  acre  of  an  extra  inch  of  rainfall,  for  each  week  of  the  year,  such  a 
calculation  would  leave  out  of  consideration  the  all-important  fact  that  this  effect 
may  be  expected  to  change  continuously  during  the  year.  The  true  partial  regression 
coefficients  for  neighbouring  periods  must  be  relatively  alike.  The  method  of  partial 
correlation  as  hitherto  developed,  takes  no  account  of  the  serial  character  of  our  weather 
variates  ;  after  calculating  the  partial  regression  coefficients,  we  should  still  be  far  from 
the  facts,  if  we  did  not  smooth  the  series  so  obtained  by  a  continuous  curve,  which 
should  average  out  the  independent  errors  in  the  values  obtained  for  the  successive 
weeks. 

Disregarding,  then,  both  the  arithmetical  and  the  statistical  difficulties,  which  a  direct 
attack  on  the  problem  Would  encounter,  we  may  recognise  that  whereas  wth  q  subdi- 
visions of  the  year,  the  linear  regression  equations  of  the  wheat  crop  upon  the  rainfall 
would  be  of  the  form 

w  =  c  +  a{i\  +  a,r.,  -f  . . .  -f  a/, 

where  r,,  r^,  ...,  y,  are  the  quantities  of  rain  in  the  several  intervals  of  time,  and 
a„  ...  a,  are  the  regression  coefficients,  so  if  infinitely  small  subdivisions  of  time  were 
taken,  we  should  rejilace  the  linear  regression  function  by  a  regression  integral  of  the 
form 

iv  =  c-^\  ardt, (Ill) 

w'licfc  r  (It  is  the  raiji  falling  in  the  element  of  time  (//  ;  the  integral  is  taken  over  the  whole 
period  concerned,  and  a  is  a  continuo%is  function  of  the  time  t,  which  it  is  our  object  to 
evaluate  from  the  statistical  data. 

It  will  be  seen  that  by  ijrocecdiiig  to  the  limit  all  artificiality  has  been  eliminated  from 
the  quantity  «,  which  now  represents  an  objective  ])hysical  quantity,  namely  the  average 
benefit  to  the  crop  in  bushels  per  acre  per  ijuli  of  rain,  falling  in  the  time-element  con- 
sidered. This,  of  course,  is  more  than  even  a  daily  record  of  rain  can  tell  us,  but  owing 
to  the  relatively  slow  changes  in  the  functions  a,  we  shall  find  it  sufficient  to  divide  the 
3(5(5  day  year  into  61  equal  joeriods  of  6  days  each. 
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It  should  be  noted  that  corresponding  to  the  quadratic  terms  of  a  regression  formula 
the  indejjendent  variates  of  which  form  a  continuous  series,  i.e.,  to 

s(Va). 

we  have  the  double  regression  integral 


Jo  Jo 


where  6  is  a  continuous  function  of  the  two  epochs  t  and  (',  just  as  a  is  a  continuous 
f miction  of  t  only. 

The  concept  of  the  regression  integral,  involving  a  regression  function  varying  con- 
tinuously with  the  time,  is  not  only  of  service  in  displaying  in  a  precise  mathematical 
form  the  nature  of  the  relationship  which  is  to  be  investigated,  but  when  the  functions 
a  and  b  vary  with  the  time  relatively  slowly,  it  suggests  a  statistical  procedure  by  which 
values  of  these  functions  may  be  obtained  from  the  data.  That  the  values  must  change 
relatively  slowly  is  apparent  from  the  fact  that  the  state  of  advancement  of  the  crop, 
as  indicated  either  by  the  time  of  harvest,  or  by  phenological  observations  on  common 
weeds,  often  varies  relatively  to  the  Calendar  date  by  as  much  as  a  fortnight. 

If,  then,  To,  Tj,  Tj  ...  be  a  series  of  orthogonal  functions  of  the  time,  such  that 

Jo 

we  may  express  the  rate  of  rainfall  at  any  epoch,  in  the  form 

*■  =  PoTo  +  P/i\  +  P2T2  +  •  •  •  » 

where 

also  we  may  express  the  regression  function  in  the  form 

a  =  aX  +  «J,  +  a,T,+  ..., (IV) 

where 

y.,  =  ^aT,dt, 

noting  in  the  latter  case  that  relatively  few  terms  of  the  expansion  will  be  required,  if  a 
is  a  slowly-varying  quantity. 

VOL.  CCXIII. — B-  P 
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Then  we  shall  have  from  equation  (III) 

=  c  +      ardt  =  c-\-  a„Po  +  ajPj  +  a,?,  +  . . .  . 

J  (I 


w 


Now  the    values   of    p„.    p,.    pa may    be    obtained    for   each  year    from  the  rain 

record,  and  by  correlating  them  with  the  crop  data,  we  can  obtain  the  values  of 
a„,  aj.  a,,  ...  ,  as  partial  regression  coefficients  of  the  crop  on  the  coefficients  of  the  rainfall 
distribution  ;  then  from  (IV)  we  can  evaluate  the  expansion  of  a  to  as  many  terms  as 
we  have  obtained.  It  is  easy  to  verify  that  in  the  same  manner  the  double  regression 
integral  may  be  expressed  as  a  quadratic  regression  formula  for  the  crop  in  terms  of  the 
rainfall  distribution  values. 

The  advantage  of  this  indirect  method  of  attack  lies  not  only  in  the  factfe  that  for  a 
fine  division  of  the  year,  the  arithmetical  difficulties  of  a  direct  attack  are  insurmountable, 
and  that  no  existing  crop  record  is  sufficiently  long  to  justify  the  calculation  from  it  of 
50  or  60  independent  regression  coefficients,  for  the  probable  error  of  such  determinations 
depends  on  the  excess  of  the  number  of  observations  over  that  of  the  coefficients  evaluated 
(9),  but  also  in  the  increased  accuracy  attained  through  utilising  our  knowledge  that  the 
regression  function  must  in  fact  vary  relatively  slowly.  This  advantage  is  analogous  to 
that  obtained  by  the  author  in  collaboration  with  Miss  Mackenzie,  who  found  that  using 
the  correlations  of  weekly  rainfall  between  different  stations  in  Great  Britain  from  a 
record  of  about  40  years,  it  is  possible  to  estimate  the  correlation,  for  any  one  week 
of  the  year,  with  an  accuracy  which,  for  any  single  week,  would  have  required  records 
for  over  1,000  years  ;  this  was  due  to  the  fact  that  the  sequence  of  correlations  could  be 
well  represented  by  a  harmonic  curve  of  only  the  second  order,  so  that  use  could  be  made 
of  some  2,000  actual  simultaneous  observations  of  weekly  rainfall  to  eliminate  errors  of 
random  sampling  (8.  Fisher  and  Mackenzie,  1922).  In  the  present  investigation  the 
accuracy  is  limited  by  the  number  of  crops  recorded,  but  the  method  employed  enables 
full  use  to  be  made  of  the  meteorological  data. 

In  the  practical  application  of  the  method  of  the  regression  integral,  it  is  not  necessary 
that  the  rainfall  record  should  be  strictly  continuous  ;  indeed  daily  observations  are 
themselves  too  numerous  to  be  conveniently  handled.  The  rain  was  therefore  divided 
up  into  61  periods  of  6  days  each,  the  integrations  being  replaced  by  summations  over 
61  terms.  This  introduces  a  slight  modification  into  the  form  of  the  orthogonal  functions 
of  the  time,  for  it  is  necessary  for  exact  work  that 

S(T;r.)         r^, 

1 

should  vanish.  The  most  convenient  orthogonal  functions  to  use  are  those  developed 
by  EsscHER  (3)  in  respect  to  mortality,  and  independently  by  the  present  author,  in 
eliminating  the  slow  changes  observable  in  the  wheat  yields  of  the  experimental  plots 
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(5,  1921).     If  the  time  is  measured  from  the  middle  point  of  a  series  of  n  terms,  in 
units  equal  to  the  interval  between  successive  terms,  we  have 

T  =1 


l)(n^-4) 


(<—«.), 


2,800 


l)(n'-4)(n='-  9)\ 


n., 


44,100 /^4  _   n^  —  n.Jn^  ,  .  n^n^  —  w^' 

,'  —  1)  ('W=^  —  4)  (rr  —  9)  (n'  —  16)  V         «i  —  n/  ^4  —  n/ 


698,544 


tf—  1 )  {iv  —  4)  («-  —  9)  (/r  —  1 6)  {71?  —  25)  \  w^^b  —  ^^i" 


n,  —  «/ 


n^^e  —  n^ 


where  n,,  '«i,  ^^d  ^^^  '^s  stand  for  the  mean  values  of  f ,  i*,  f  and  f .     So  that 

^^2  =  tV(«''-  1)' 


«, 


=  ^L(3n'-7), 


«^'=a„(»-l)  (»'-«. 


^^^ H  =  1 «  ('*"  -  7), 


TJr,8(15«*-230n^  +  407). 


These  functions  are  in  reality  very  convenient  to  handle,  the  practical  arithmetic  involved 
will  be  explained  in  Section  (5)  under  the  analysis  of  the  rain  data.  If  n  be  increased 
the  functions  tend  to  take  the  form  of  Legendre  Polynomials. 


4.  The  Correlation  of  Residuals. 

When  it  is  desired  to  study  the  correlation  of  variables  such  as  annual  figures,  in  which 
progressive  changes  are  observable,  it  is  necessary  in  order  to  obtain  results  which  can 
be  relied  upon,  and  compared,  in  the  same  manner  as  those  obtained  from  homogeneous 
material,  to  eliminate  in  some  way  the  influence  of  the  progressive  changes.  Material 
of  this  kind  is  very  abundant  and  of  the  utmost  importance  ;  the  bulk  of  official  statistics, 

p  2 
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vital,  economic,  epidemiological,  meteorological  and  agricultural,  may  be  said  to  be 
awaiting  a  method  of  reduction  which  shall  deal  adequately  with  the  difficulties  which 
this  type  of  data  presents.  My  investigations  have  not  led  me  to  any  satisfactory  or 
complete  solution  of  the  problem,  but  since  systematic  methods  have  been  developed 
and  strongly  advocated,  it  would  seem  worth  while  at  the  present  time  to  put  forward 
a  group  of  mutually  connected  considerations,  which  slied  new  light  upon  various  aspects 
of  the  problem,  and  bring  clearly  into  view  the  sources  of  error  to  which  the  more  obvious 
methods  of  approach  are  exposed. 

It  has  been  observed  (4,  Fisher,  1915)  that  if  the  individual  values  of  a  sample  be 
taken  as  the  co-ordinates  of  a  point  in  generaUsed  space,  the  mean  standard  deviation  and 
coefficient  of  correlation  of  a  sample  are  capable  of  a  very  beautiful  geometrical  inter- 
pretation.     For  if  we  assign  any  value  Jc  as  the  mean  of  n  observations  x^,  x^,  ... ,  x^, 

the  equation 

S  (x)  =  nx 

represents  a  plane  section  of  the  distribution,  placed  at  right  angles  to  the  Una 

ajj  =  a^g  =^  ...  x^ 

and  meeting  it  at  the  point 

X-^  ^  x<i^=  ...  ^  a;„  =  X. 

If  this  point  is  taken  as  origin,  we  have 

x  ^  0, 
and  the  point  representing  any  sample  has  rectangular  co-ordinates 

3^1  X^  X^         X^  •  •  •  J  '*'»  **') 

so  that  its  distance  from  the  new  origin  is  the  square  root  of 

«tx2  =  S  (a;  -  xf 

where  ju.  is  the  standard  deviation  of  the  sample. 

The  mean  of  the  observations  thus  specifies  the  origin,  and  the  standard  deviation 
specifies  the  length  of  the  radius  vector  from  the  origin  to  the  sample  point ;  the  corre- 
lational properties  of  the  sample  depend  on  the  direction  of  this  radius  vector.  For  if  a 
corresponding  construction  be  made  for  the  values  of  a  second  variate 

2/i,  2/.,  •••  ,  2/„ 

and  each  of  the  axes  of  the  ^/-space  be  brought  into  coincidence  with  the  corresponding 
axes  of  the  x-space,  it  has  been  shown,  and  it  is  easy  to  see,  that  the  correlation  coefficient 
between  x  and  y  is  the  cosine  of  the  angle  between  the  two  rad/ii  vectores. 

The  geometrical  interpretation  of  the  correlation  coefficient  provides  the  simplest 
possible  proof  of  a  theorem  of  some  importance  in  the  present  connection,  namely,  that : 
//  x/,  x^,  ...  xj  he  the  co-ordinates  of  the  first  sample-point  with  respect  to  any  rectilinear 


ON   THE   YIELD  OF   WHEAT  AT   ROTHAM.STED.  101 

orthogonal  axes  through  the  same  origin  and  y/,  y./,  ...  y„'  the  corresponding  functions  of 
VvVi  ■■■  Vn  then  the  correlation  coefficient  between  a;'  and  y'  is  equal  to  the  correlation  coefficient 
between  x  and  y. 

The  system  of  orthogonal  functions  of  the  chief  practical  importance  are  those  which 
arise  in  the  ehmination  of  progressive  changes  by  means  of  a  polynomial  function  of  the 
time.  These  are  the  same  functions  as  are  used  in  analysing  the  sequence  of  weather 
in  each  particular  season  ;  it  is  worth  noting  that  the  above  theorem  is  true  of  any 
orthogonal  system. 

In  an  unchanging  series  of  independent  values,  i.e.,  one  in  which  the  frequency  of 
occurrence  of  any  value  is  independent  of  the  time,  all  the  quantities  x',  except  cc/,  will 
be  distributed  in  symmetrical  curves  about  the  mean  value  zero.  If  the  values  of  x 
are  normally  distributed  about  the  mean  ni,  with  standard  deviation  a,  then  all  the 
values  of  x'  will  be  normally  distributed  with  standard  deviation  a,  x/,  about  a  mean 
m\/n,  and  the  remainder  about  zero.  This  follows  at  once  from  the  fact  that  the  distri- 
bution of  the  points  representing  the  sample  will  be  a  globular  normal  cluster,  and  conse- 
quently the  distribution  of  their  projections  on  a  line  drawn  in  any  direction  ■will  be 
invariable.  Even  when  the  values  of  x  are  not  normally  distributed,  those  of  a/  will 
in  general  be  much  more  nearly  so,  owing  to  the  properties  of  compound  distributions 
which  have  often  been  discussed. 

The  simplest  type  of  changing  series  is  one  in  which  the  several  values  are  distributed 
independently  and  in  similar  distributions,  but  about  a  changing  mean.  It  is  with  a 
view  to  eliminating  slow  changes  in  the  mean,  that  most  methods  of  obtaining  residuals 
from  smoothed  values  are  employed.  It  is  usually  the  case  that  slow  changes  in  the 
mean  value  may  be  sufficiently  represented  by  a  polynomial  whose  degree  is  small 
compared  to  the  number  of  observations  in  the  series.  In  such  cases  the  first  few  values 
of  x'  will  be  abnormally  gi-eat  (positive  or  negative),  while  the  remainder  will  be  normally 
and  independently  distributed.  The  whole  group  of  values  of  x'  is  heterogeneous,  and 
the  correlation  of  two  such  series  will  have  no  easy  interpretation  ;  its  value  may  be 
governed  by  the  values  of  a  few  exceptional  terms,  and  therefore  its  probable  error  will 
be  excessive.  If,  however,  we  reject  the  exceptional  terms  and  calculate  the  correlation 
from  the  normally  distributed  remainder,  we  shall  be  correlating  homogeneous  material 
and  the  coefficient  obtained  will  have  a  simple  interpretation  and  its  usual  jjrecision. 
Thus  if  the  first  p  values  of  x'  suffice  to  specify  the  slow  changes  in  the  mean  value,  the 
correlation  required  is  that  obtained  by  ignoring  the  first  p  values  of  x',  and  y\  or  in  other 
words  putting  them  equal  to  zero. 

This  process  is  only  an  extension  of  the  universal  practice  of  eliminating  the  mean 
values  of  the  variates  in  calcidating  the  correlation  coefficient  ;  in  working  with  an 
unchanging  series  we  habitually  ignore  the  values  of  a/,  and  y\.  In  the  more  general 
case  we  take 

{n  -  p)  ^;^  =  S  {x'%         {n  -  p)  ^t/  --=  S  {y%         {n  -  p)  v-^v-X  =  S  {x'y'). 
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From  another  point  of  view  this  may  be  regarded  as  the  result  of  finding  the  partial 
correlation  between  x  and  y  when  the  first  {f  —  \)  powers  of  t,  regarded  as  a  system 
of  correlated  variates,  are  eliminated. 

The  value  of  r  obtained  in  this  way  is,  as  we  have  shown,  identical  with  that  obtained 
from  the  mean  squares  and  mean  products  of  the  residuals  of  the  two  series  after  fitting 
polynomials  of  degree  {p  —  1).  The  values  of  such  residuals  are  indeed  heterogeneous 
in  respect  of  variability,  and  mutually  connected  by  p  equations  ;  but  since  the  values 
of  x'  (and  equally  of  y')  are  independent  and  equally  variable,  the  value  of  r  obtained 
from  such  residuals  will  be  distributed  in  random  samples,  exactly  as  is  that  obtained 
from  an  ordinary  correlation  oi  n  —  j?  +  1  independent  homogeneous  values.  The 
frequency  distribution  of  r  in  such  cases  has  been  treated  in  detail  (6,  Fisher, 
'  Metron.'  1921). 

The  elimination  from  data  of  obvious  heterogeneity  is,  of  course,  a  different  process 
from  a  process  which  leaves  the  material  necessarily  homogeneous.  The  latter  process 
is  jjerhaps  unobtainable,  but  it  would  be  desirable,  in  cases  where  complications  are 
expected,  to  have  a  means  of  testing  whether  the  heterogeneity  has  been  sufficiently 
eliminated.  The  elimination  of  the  slowest  changes  is  easy,  since  the  earliest  terms  of 
the  polynomial  are  quickly  calculated.  The  size  of  the  coefficients  affords  a  criterion 
of  rejection  ;  thus  for  a  record  of  the  dressed  grain  from  two  plots  on  Broadbalk  for 
67  years  I  obtained  (5,  1921). 


Dunged  plot  2b. 

Plot  11  (no  potash). 

Degree. 

x'lix. 

P. 

Degree. 

x'jii. 

P. 

I 
2 
3 
4 
5 

-0-82 
-2-38 
-2-50 
-2-06 
■4-4-02 

0-41 

0-017 
0-012 
0-039 
0-000058 

1 
2 

3 

4 
5 

-4-78 
+0-51 
+0-20 
-0-96 
+1-72 

(l-OIKHHIlS 

0-61 
0-84 
0-34 
0-086 

This  first  column  shows  the  degree  of  the  polynomial  term  ;  the  second,  the  value  of  x' 
in  terms  of  the  standard  deviation  as  estimated  from  the  residuals  after  a  curve  of  the  5th 
degree  had  been  eliminated  ;  the  third  column  gives  the  probability  of  such  a  term 
occurring  by  chance. 

Since  the  standard  deviation  is  derived  effectively  from  the  sum  of  61  independent 
squares,  the  exact  form  of  the  distribution  of  x/fx,  which  is  Student's  Type  VII  curve, 
for  n  —  62,  is  taken  to  be  equivalent  to  the  normal  distribution.  On  the  dunged  plot 
the  first  term  gives  no  grounds  for  elimination  ;  no  significant  deterioration  having 
occurred  on  this  plot,  and  it  is  immaterial  whether  or  not  such  a  term  were  retained  in 
the  correlations.     The  remaining  four  terms  all  show  the  existence  of  a  slow  change 
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in  the  yield.  These  four  terms  contribute  to  the  variance  over  half  as  much  as  the  re- 
maining 61  terms  put  together.  It  should  be  noted  that  while  little  can  be  lost  in  reject- 
ing the  first  term  from  the  correlation,  nothing  can  be  gained  by  retaining  it,  for  though 
its  smallness  renders  it  innocuous,  its  claim  to  rank  as  part  of  the  homogeneous  material 
is  overturned  by  the  size  of  the  following  terms.  On  plot  11  the  deterioration  due  to 
exhaustion  of  potash  is  very  great,  and  no  other  term  except  perhaps  the  5th  is  large 
enough  to  excite  suspicion.  Our  grounds  for  eliminating  the  first'  five  terms  in  this 
case  are  that  all  the  plots  show  similar  and  almost  proportional  slow  changes,  and  that 
this  is  doubtless  the  case  with  Plot  11,  where,  however,  the  great  variability  of  the 
annual  yields  has  masked  the  significance  of  the  coefficients. 

Certainty  as  to  the  homogeneity  of  the  remaining  values  of  x'  could.  I  think,  only  be 
obtained  by  a  complete  calculation  of  their  values  ;  in  the  absence  of  tables  of  the  co- 
efficients this  would  be  very  laborious.  The  whole  set  should  be  normally  distributed 
about  zero,  and  heterogeneity  would  be  indicated  by  Pa  significantly  exceeding  its 
normal  value,  3.  In  .the  neighbourhood  of  the  normal  curve  Pa  ^^^  been  shown  to  be 
the  most  efficient  statistic  for  such  a  test  (7,  1922)  ;  a  method  of  calcidating  Pa  for  the 
x'  distribution  from  the  actual  values  of  x  would  therefore  be  a  valuable  resource, 
especially  if  any  way  could  be  found  of  calculating  it  without  the  separate  evaluation 
of  the  values  of  x' .  It  is  believed  that  little  heterogeneity  remains  in  the  series  of  wheat 
yields  from  Broadbalk  after  fitting  polynomials  up  to  the  5th  degree. 

The  occuiTence  of  series  of  values  exhibiting  slow  changes  is  so  widespread  that  it 
is  not  surprising  that  a  number  of  different  methods  have  been  prepared  for  dealing  with 
the  difficulty,  and  that  some  difference  of  opinion  should  exist  as  to  the  disadvantages  of 
each.  We  shall  show  that  all  the  methods  of  elucidation  which  have  been  at  all  fully 
developed  fall  into  one  class,  and  that  the  main  drawback  of  this  class  of  methods  is 
reduced  to  a  minimum  by  the  use  of  polynomials  fitted  to  the  whole  series. 

Slow  changes  may  be  eliminated  in  several  ways  : — 

(i)  by  fitting  a  polynomial  or  other  ciu've  to  the  whole  of  the  data  ; 
(ii)  by  the  use  of  "  smooth  "  values  obtained  by  compounding  a  number  of  neigh- 
bouring terms  ; 
(iii)  by  repeatedly  differencing  the  observed  series. 

In  all  these  cases  the  resulting  residuals  designed  for  use  as  correlation  variates 
may  be  successfidly  freed  of  any  slow  progressive  trend  which  vitiates  the  original  series ; 
but  in  all  cases  this  is  achieved  by  entangling  together  to  some  extent  the  successive 
values.  This  effect  is  inevitable,  for  we  must  judge  of  the  smooth  value  from  which  our 
residual  is  measured  by  the  values  at  neighbouring  epochs. 

It  might  appear  that  simple  differencing  should  be  placed  in  a  separate  class,  since 
here  no  smooth  value  objectively  appears.  There  is  no  essential  difference  however  ; 
for  if  2r  is  any  even  integer,  we  can  construct  a  function, 
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which  is  in  effect  the  smooth  value,  the  deviations  from  which  are  proportional  to  the 
{2ry^  differences  ;  to  make  v^  a  minimum  for  a  series  of  equally  variable  quantities  in 
random  order  we  require 

j^_    {2rjY_ 


Thus  the  smooth  value  corresponding  to  6th  difference  is 

%  =  Mo  +  ^f  T  SX 

=  ^Jx  {l3lMo  +  75  (<*,  +  M_,)  —  30  {u.,  +  u_.,)  -I-  5  {u^  +  m.j)}  . 

The  formula  so  obtained  is  identical  with  that  given  by  Sheppard  (15,  p.  31),  for  7  point 
smoothing,  using  a  polynomial  of  the  5th  degree.  The  variate  difference  method  is 
therefore  only  an  extreme  form  of  the  use  of  Sheppard's  smoothing  formula — ^the 
extreme  in  which  the  number  of  terms  is  a  minimum  for  given  degree  of  the  slow  change 
eliminated — while  the  other  extreme  is  represented  by  the  process  of  fitting  a  poly- 
nomial of  the  required  degree  to  the  whole  of  the  series. 

In  treating  these  three  processes  as  special  forms  of  Sheppard's  smoothing  process 
one  distinction  must  be  made.  In  fitting  a  polynomial  to  the  whole  of  the  series  we  wish 
to  use  not  only  the  residual  of  the  middle  term,  but  the  whole  series  of  residuals.  So 
in  using  smooth  values  from  (say)  a  15  point  formula,  the  first  and  last  seven  residuals 
may  be  obtained  from  the  curves  fitted  to  the  first  and  last  sets  of  15  points.  In  the 
applications  which  have  been  made  of  the  Variate  Difference  Method,  only  the  residuals 
of  the  middle  terms  have  been  used.  The  number  in  the  series  has  been  diminished 
by  one  with  each  differencing  ;  if  however  we  wish  to  add  the  missing  terms,  this  is 
easily  done  by  means  of  binomial  coefficients  ;  for  example  if 

a,  b,  G 

are  the  sixth  differences  of  a  series,  the  three  missing  residuals  prior  to  a  are  as  follows 


a 

,     6 

15 

H a, 

—  —a, 

a,  h,  c. 

liU 

20    ' 

20    ' 

The  effects  of  such  processes  in  entangling  the  neighbouring  terms  may  best  be  .seen 
by  ('onsidering  the  effect  of  applying  them  to  an  unchanging  series  of  equally  variable 
quantities.  If  u  stand  for  such  a  quantity  then,  for  example,  the  sixth  difference  may 
be  written 

V  =  -  20u,  +  15  («,  +  «_,)  -  6  {u,  +  «_,)  -f  {u,  +  u_,), 

hence  evidently 

v"  =  H-  924  ??, 


VW  =  +  495  u', 
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and  so  on,  the  numerical  coefficients  being  those  of  the  expansion  (1  +  xY^.     Conse- 
quently the  correlations  between  neighboimng  values  of  v  will  be 


r,  =  - 

-f 

= 

—  0-8571, 

^2  =^  S  • 

= 

+  0-5357, 

r,  =  - 

.  4 

r. 

(i  

T  — 

—  U-2381, 

r,= 

= 

+  0-0714. 

n  = 

-0-0130, 

n  = 

+  0-0011 

If  such  high  correlations  as  these  are  produced  in  an  originally  uncorrected  series,  it  is 
clear  that  they  cannot  be  used  without  drastic  correction  in  an  examination  into  such 
correlations  between  neighbouring  terms  as  exist  in  the  original  series.  Equally  large 
are  the  efltects  when  two  separate  series  are  correlated.  If  we  take,  for  example,  a  second 
series  u',  and  obtain 

v'  =  8"tf', 

then  if  the  original  two  series  were  correlated  so  that 


it  is  easy  to  see  that  the  correlations  obtained  from  the  6th  differences  will  be  expressible 
in  terms  of  the  p  series,  in  the  form 

By  such  a  method  therefore  we  shall  not  obtain  the  true  values  p^,  but  quantities 
proportional  to  the  12th  difference  of  the  series.  Only  if  all  values  of  p,  except  one, 
vanish,  and  we  correlate  the  corresponding  values  of  u  and  u',  will  we  obtain  an  estimate 
of  the  correlation  unaffected  by  gross  inaccuracy.  This  constitutes  a  fatal  objection  to 
the  applications  which  have  been  made  of  the  variate  difference  method  in  its  original 
form. 

The  same  som'ce  of  error  still  persists  in  more  moderate  degree  when  smoothing  formula; 
involving  more  terms  are  used.  Let  us  take  for  example  Sheppard's  formula  for  fitting 
a  polynomial  of  the  5th  degree  to  sets  of  15  points.     The  smoothed  middle  point  is 

(1  -  ^¥  S«  +  B  S«  +  n  S'»  +  n  S12  +  -if^  81^)  u  ; 
whence  we  have  the  residual 

v=    — ^-— — {35126iio— 10125(Wi  +  M_,)  — 7500(w,  +  M_3)  — 3755(m3  +  m_3) 

li.io.i/.iy 

+  165  (Mi  +  it_  J  — 2937  K  +  i<_5)  +  2860(«e  +  «-.)—2145(M7 +  «_;)}. 
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The  correlations  between  neighbouring  terms  of  an  originally  uncorrelated  series  now 

become 

r,=^  -0-3075  '-8=  -  0-0132 

r,=  -  0-2370  r,=  +  0-0072 

^3=   -0-1119  r,,=  +  0-0158 

r.  =  +  0-0355  •                      »■„  =  +  0-0099 

r,  =  +  0-1440  ?-,2  =  -  0-0027 

r,3=  +0-1211  »-i3=  -  0-0076 

r,=   -  0-1565  r,,=  +  0-0028. 

The  correlations  are  now  much  more  moderate.  Their  alterations  are  less  violent,  and 
since  their  sum  is  compelled  to  be  —  0-5,  their  actual  values  are  permitted  to  be  con- 
siderably smaller.  Such  values  are,  however,  sufficiently  large  to  show  that  great 
inaccuracy  will  be  introduced  if  we  ascertain  the  mutual  correlations  of  members  of  a 
series  from  those  of  the  residuals  from  smooth  values  obtained  from  15  adjacent  points. 
It  is  evident  also  that,  apart  from  the  question  of  the  correlation  of  successive  values, 
the  correlations  between  two  series  will  be  vitiated  in  the  same  manner  as  by  the  variate 
difference  method,  though  to  a  less  degree  ;  the  correlations  obtained  may  be  expressed 
in  terms  of  high  differences  of  the  true  correlations,  for  if  we  write 

i>  (s^)  =  'ii  8"  +  i-i  S'  +  M  s^"  +  f?  S^'  +  ^  S", 

and 

then  we  have 

The  general  problem  of  eliminating  the  cross  correlations  from  a  pair  of  series  showing 
slow  changes  is  extremely  complex ;  in  the  simplest  case  each  value  of  the  one  series 
is  correlated  with  only  two  adjacent  values  of  the  second  series.  To  this  case  probably 
belongs  the  relation  of  wheat  crop  to  weather,  for  the  crop  is  admittedly  much  afiected 
by  the  weather  in  the  harvest  year,  and  may  to  a  much  less  extent  be  influenced,  through 
the  condition  of  the  seed,  or  of  the  soil,  by  the  weather  of  the  previous  year.  It  is 
fortimate,  therefore,  that  one  of  the  examples,  upon  which  the  largest  amount  of  work 
has  been  expended,  is  probably  also  of  the  same  type,  and  will  serve  to  indicate  the  order 
of  magnitude  of  the  errors  to  be  expected,  through  neglecting  the  cross  correlation  of  the 
crop  with  the  weather  of  the  previous  year. 

The  two  series  of  infantile  death  rates  in  the  first  and  second  years  of  life,  are  probably 
connected  principally,  if  not  wholly,  by  the  fact  that  the  mortality,  in  any  one  year,  of 
children  in  the  second  year  of  life,  refers  to  nearly  the  same  group  of  children  as  the 
mortality  in  the  previous  year  of  children  in  the  first  year  of  life  ;  and  by  the  second  fact 
that  mortality  in  the  two  age  groups  during  the  same  year  will  be  conditioned  by  the 
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same  meteorological  and  epidemiological  conditions.  If  the  first  effect  is  the  main  object 
of  study,  the  latter  ^vill  appear  as  a  cross  correlation  introducing  errors  into  our  estimate 
of  the  first  effect,  according  to  the  method  of  estimation  employed. 

In  1915  (1)  Elderton  and  Pearson  fomid,  using  the  variate  difference  method, 
the  value  —  0  •  688  for  the  correlation  between  the  mortalities  of  the  same  group  of  childi'en 
(males)  in  the  two  years.  In  1923  the  same  avithors  using  Shepp arc's  15-point  smoothing 
formula  find  the  value  — 0-463.  The  discrepancy  is  very  great  and  suggests  that  the 
neglect  of  the  correlation  between  the  mortalities  of  the  two  groups  of  children  in  the  same 
year  has  produced  a  large  negative  bias,  which  is  more  pronounced  in  the  earlier  estimate. 
Fitting  a  polynomial  to  the  whole  series  further  diminishes,  though  it  does  not  eliminate, 
the  error  ;  for  the  polynomials  of  the  4th,  5th  and  0th  degrees  we  find  the  values 
-0-308,  -  0-311,  -0-377. 

Now,  assuming  that  only  two  correlations  are  really  operative,  these  values  may  be 
corrected  by  calculating  in  a  similar  manner  the  apparent  correlations  in  mortality  for 
the  two  groups  of  children  in  the  same  year.  These  values  are  +  0-4706,  +  0-5456  and 
+  0-5217  ;  taldng,  then,  the  correlation  between  successive  residuals  of  a  series  of  n 
terms  fitted  by  a  polynomial  of  degree  r  to  be  —  (/•  +  l)/(«  —  1),  we  have  the  equations 

r       Po -0-0877p,  = -0-3083 
4th  degree-^  P,  =  -0-2691; 

L  -  0-0877po  +  pi  =  + 0-4706 

r       p,  -  0-1053pi  = -0-3109 
5th  degree  \  p  =  -  0  •  2563  ; 

L  -0-1053p„  +  p,  =  +0-5456 

r        p„-0-1228p,  = -0-3768 
6th  degree^  p„=- 0-3175. 

1  -0-1228p,  +  p,  =  +0-5217 

The  values  of  p,,  obtained  from  these  equations  are  unbiassed  estimates  of  the  corre- 
lation required  ;  they  agree  in  indicating  a  correlation  about  —  0-3.  This  value  may  be 
confirmed  from  the  figm'es  given  by  Pearson  and  Elderton  for  their  15-point  smooth 
curve,  which  lead  to  the  equations 

Po  -  0-3075p,  =  -0-4548 

p,  =  -0-2808. 
-  0-3075  Po  +  Pi  =  + 0-6521 

The  concordance  of  these  results  indicates  that  whereas  the  variate  difference  method 
has  exaggerated  the  value  of  this  correlation  to  the  extent  of  more  than  doubhng  it, 
the  correlation  of  residuals  from  the  15-point  smooth  curve  has  reduced  the  error  to  about 
50  per  cent.,  and  is  moreover  capable  of  correction  provided  only  a  few  important  corre- 
lations are  present.  The  method  of  polynomial  fitting  has  introduced  errors  of  about 
20  per  cent,  only,  and  the  correction  applied  to  it  may  be  expected  to  be  for  this  reason 
all  the  more  precise. 

In  calculating  the  effects  of  weather  upon  the  crop,  it  is  probable,  therefore,  that  the 

Q2 


108  MR.  E.  A.  FISHER  ON   THE  INFLUENCE  OF  RAINFALL 

effects  of  weather  previous  to  the  harvest  year  considered  may  be  ignored.  Hooker 
(10,  1907)  has  indeed  indicated  that  some  real  effect  may  be  ascribable  to  previous 
weather,  but  his  correlations  in  the  case  of  wheat  are  low,  and  as  we  have  seen,  if  the 
polynomials  fitted  to  the  whole  series  are  utilised  to  eliminate  the  slow  changes,  the 
spurious  correlations  introduced  by  this  cause  must  be  much  fui-ther  diminished.  In  the 
case  of  the  infantile  death  rates  when  the  correlation  ignored  was  much  greater  than  that 
which  we  sought  to  evaluate,  the  value  obtained  was  raised  only  from  —  0-26  to  —  0-31  ; 
in  the  present  case  we  may  be  certain  that  the  ignored  effects  are  much  smaller  than  those 
evaluated,  and  their  effect,  if  any.  on  the  values  obtained  must  be  extremely  small. 

5.  The  Rothamsted  Rain  and  Wheat  Data. 

The  Rothamsted  wheat  data  employed  in  this  investigation  are  derived  from  Broad- 
balk  field,  which  has  grown  wheat  under  experimental  conditions  since  1844.  Thirteen 
plots  have  been  contiiuiously  under  uniform  treatment  since  1852.  This  series  is  unique 
in  its  length,  which  as  we  have  seen  is  a  most  important  consideration  for  statistical 
purposes.  A  second  feature  of  great  value  is  that  the  yields  are  derived  from  measured 
plots  by  actual  weighing,  and  are  not  derived  from  estimates  based  on  visual  observation 
as  are  the  county  averages  published  by  the  Ministry  of  Agricidtm'e.  An  accoimt  of 
the  variations  in  yield  of  dressed  grain  of  these  13  plots  has  already  been  published 
(5,  Fisher,  1921).  It  was  found  that  the  variations  observable  could  be  divided  into 
three  groups,  ascribable  to  three  separate  causes,  (i)  On  many  of  the  plots  a  pro- 
gressive fhminution  is  observable  owing  to  the  exhaustion  of  the  soil  in  certain  of  the 
essential  plant  nutrients  ;  (ii)  on  all  the  plots  slow  changes  in  yield  have  taken  place, 
which  may  probably  be  ascribed  to  variations  in  the  weed  infestation  of  the  field  as  a 
whole  ;  this  variation,  unlike  that  ascribable  to  other  causes,  is  in  all  the  plots  approxi- 
mately proportional  to  the  mean  yield.  These  two  causes  of  variation  may  be  eliminated 
by  fitting  to  each  series  a  polynomial  of  the  5th  degree.  The  remaining  variation, 
consisting  of  the  deviations  of  the  actual  yield  from  the  smooth  average  value  given 
by  the  polynomial,  is  ascribable  primarily  to  variations  of  the  season  under  which 
the  crop  grew  to  maturity.  Conventionally  this  season  has  been  taken  to  be  the  weather 
for  360  days,  ending  on  August  31st  of  the  harvest  year.  The  first  two  causes  of  variation 
have  already  been  discussed  ;  it  is  the  purpose  of  the  present  paper  to  consider  the 
fluctuations  in  yield  from  year  to  year  in  relation  to  the  rainfall  record. 

Table  II  gives  the  manurial  treatment  of  the  13  plots  considered  together  with  the 
mean  yields  obtained  from  them,  and  the  mean  rate  of  deterioration.  In  Table  III  are 
shown  the  deviations  in  each  year  from  the  corresponding  polynomial  value  ;  it  will  be 
observed  that  (certain  years  have  been  omitted  as  not  available  for  correlation  with 
rainfall.  The  rainfall  record  conmienced  in  February,  1853,  and  is  therefore  comiilete 
for  the  crop  harvested  in  1854  and  subsequently.  In  1889  and  1890  the  field  was 
partially  fallowed  by  halves,  and  the  crops  in  1890  and  1891  following  the  fallow  were 
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beneficially  affected  ;  tlie  same  is  true  of  the  harvests  of  1905,  1906  and  also  of  1915. 
It  has  been  possible  to  use  the  value  for  1916,  since  in  this  case  the  two  halves  of  the 
field  were  harvested  separately,  and  the  half  fallowed  in  1915  could  be  rejected.  With 
these  omissions  it  is  believed  that  the  series  of  crop  yields  from  this  field  are  as  homo- 
geneous a  series  as  it  would  be  possible  to  obtain  ;  it  will  be  seen  that  60  years  are  avail- 
able for  correlation  with  rainfall. 

Table  II. 


Plot. 


2b 

3&4 

5 

6 

7 

8 

10 

11 

12 

13 

14 

17 

18 


T 


Manure  per  acre. 


Sulphate      Sulphate 

of  of 

potash.  soda. 


Sulphate 
of 

magnesia. 


Super- 
jihosphate. 


Sulphate 

of 
ammonia. 


Chloride 

of 
ammonia. 


Mean 
(Bushels 
per  acre). 


Mean 

aimual 

diminution 

(Bushels 

per  acre). 


lb. 

lb. 

lb. 

lb. 

lb. 

lb. 

Dung 

14  tons 

— 

— 

— 

— 

34-549 

No  Ma 

nure 

— 

— 

— 

— 

12-269 

200 

100 

100 

392 

— 

— 

14-180 

200 

100 

100 

392 

100 

100 

22-581 

200 

100 

100 

392 

200 

200 

31-367 

200 

100 

100 

392 

300 

300 

35-694 

— 

— 

— 

— 

200 

200 

19-504 

— 

— 

— 

392 

200 

200 

22-046 

— 

366i 

— 

392 

200 

200 

28-319 

200 

— 

— 

392 

200 

200 

30-209 

— 

— 

280 

392 

200 

200 

27-765 

200 

100 

100 

392 

— 

— 

14-510 





— 

— 

200 

200 

29-006 

0-031 
0-097 
0-090 
0-141 
0-144 
0-092 
0-157 
0-219 
0-181 
0-123 
0-231 
0-092 
0-114 


*  Alternate. 


In  February,  1853,  readings  of  rainfall  were  commenced  with  a  large  gauge,  0-001 
acre,  built  for  the  purpose  ;  the  readings  of  this  large  gauge  have  been  consistently  higher 
than  those  of  the  5-in.  and  8-in.  gauges  which  are  at  present  placed  beside  it.  It  may 
be  concluded  that  the  large  gauge  gives  a  better  estimate  of  the  amount  of  rain  falling 
on  the  field.  Daily  readings  are  available  for  the  whole  period,  the  rain  up  to  9  a.m. 
being  ascribed  to  the  previous  day.  The  rainfall  was  divided  as  explained  in  Section  3 
into  61  periods  of  6  days  each,  and  each  set  of  61  values  was  then  analysed  by  calculating 
the  coefiicients  of  the  polynomials  up  to  the  5th  degree.  Thus  the  amount  and  distribu- 
tion of  rain  in  each  season  was  represented  by  a  series  of  6  numbers  ;  such  a  representa- 
tion of  a  complicated  sequence  of  events  might  seem  to  be  insufficient,  but  it  has  been 
shown  in  Section  3,  that  only  such  coefficients  are  required  as  correspond  to  the  regres- 
sion function  ;  since  the  latter  varies  relatively  slowly,  little  would  be  gained  by  following 
in  more  detail  the  rapid  fluctuations  of  the  weather. 

The  computation  of  the  rainfall  coefficients  involved  a  gi-eat  deal  of  labour.  The  method 
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Table  III. 


17  and  18. 

2b. 

3&4. 

5. 

6. 

7. 

8. 

10. 

U 

12. 

13. 

14. 

Min- 

Ammon. 

erals. 

Sulphate. 

1854 

+  9-54 

+  6-86 

+  5-97 

+ 

7-62 

+  11-95 

+  13-98 

+  12-79+15-83 

+  13-16 

+  12-98 

+  12-44 

+ 

6-43 

+  13-23 

-f  0 

45 

+  1 

33 

-  0 

42 

— 

0 

06 

2 

93 

-  5 

65 

-  1 

141-  7 

67 

-  2 

61 

-  2 

80 

-  2 

62 

— 

0 

67 

+  1-04 

1856 

+  0 

15 

-  1 

59 

+  0 

54 

— 

1 

34 

-■  0 

75 

-  0 

03 

-  0 

07|+  0 

91 

-  2 

44 

-  3 

45 

-  0 

99 

— 

1 

66 
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+  3 

81 
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89 

+  4 

63 

+ 

5 

69 
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03 

+  7 

82 

+  5 

25 

+  8 

10 

+  fi 

90 
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98 
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74 

+ 

6 

26 

+  7-02 

1858 

+  0 

47 
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14 

-  0 

08 

1 

05 

-  0 

52 

+  0 

51 

-  1 

53 

+  0 

63 

+  0 

20 

+  0 

38 

+  0 

93 

+ 

1 

39 

-  0-26 

-  2 

45 

+  1 

45 

+  1 

73 

+ 

0 

01 

-  5 

04 

-  7 

39 

-  4 

70 

-  4 

10 

_  2 

84 

-  2 

62 

-  2 

85 

+ 

0 

08 
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52 
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44 

_  2 

64 

7 

61 
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87 
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93 

-  8 

96 
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19 
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56 
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08 

4 

46 

-  8-53 
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04 
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74 

— 

1 

55 
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26 
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94 

-12 

12 

—  6 

74 
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32 
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89 
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52 

— 

1 

20 

-  0-60 

1862 

+  0 

26 

-  0 

06 

+  0 

08 

— 

0 

32 

-  2 

70 

-  2 

20 

-  2 

20 

-  3 

74 

-  2 

99 

-  4 

21 

-  5 

30 

_ 

0 

92 

-  5-37 

+  6 

48 

+   2 

86 

+  2 

50 

+  11 

80 

+  15 

86 

+  14 

56 

+  15 

69 

+  15 

67 

+  17 

88 

+  17 

10 

+  18 

00 

+ 

2 

37 

+  13-66 

1864 

+  3 

20 

+  1 

41 

+  0 

21 

+ 

4 

15 

+  8 

88 

+  9 

38 

+  9 

05 

+  6 

98 

+  9 

23 

+  7 

63 

+  6 

28 

0 

56 

+  4-10 

+  1 

12 

-  0 

79 

-  1 

94 

— 

1 

20 

+  4 

29 

+  3 

84 

+  3 

36 

-  1 

47 

+  0 

07 

+  2 

26 

+  2 

12 

— 

0 

54 

-  0-09 

18R6 

-  2 

53 

-  1 

50 

-  2 

34 

— 

4 

79 

+  5 

01 

-  6 

82 

+  3 

57 

-  0 

05 

-  5 

31 

-  9 

35 

-  5 

63 

— 

4 

18 

-  4-54 

-  6 

82 

-  4 

66 

-  5 

82 

_ 

8 

67 

-11 

75 

—  7 

72 

-  4 

37 

-  5 

23 

-  8 

14 

-  9 

51 

-10 

00 

— 

5 

39 

-  6-79 

1868 

+  8 

25 

+  3 

87 

+  3 

01 

+ 

4 

73 

+  6 

92 

+  9 

20 

+  3 

78 

+  6 

95 

+  8 

13 

+  6 

68 

+  9 

78 

+ 

3 

21 

+  7-58 

+  5 

51 

+  1 

83 

+   1 

61 

1 

26 

-  3 

58 

-  1 

75 

-  2 

08 

-  3 

63 

-  3 

58 

-  4 

49 

-  3 

40 

+ 

1 

29 

-  6-38 

1870 

+  4 

45 

+  3 

11 

+  5 

00 

+ 

8 

26 

+  9 

41 

+  9 

48 

+  1 

32 

+  0 

13 

+  5 

29 

+  5 

93 

+  5 

38 

+ 

4 

78 

+  5-46 

+  7 

55 

-  1 

99 

-  1 

44 

4 

60 

-  8 

10 

-  7 

36 

-10 

48 

-13 

51 

-  8 

08 

-  0 

20 

-  5 

34 

+ 

2 

27 

-  0-02 

1872 

+  1 

43 

-  0 

44 

-  0 

20 

— 

0 

55 

+  0 

01 

+  1 

13 

-  1 

59 

+  3 

36 

+  0 

87 

+  0 

17 

+  1 

38 

0 

24 

-  2-18 

-  3 

80 

+  0 

78 

-  0 

01 

— 

4 

74 

-  7 

18 

-  6 

40 

+  0 

72 

-  4 

01 

-  4 

78 

-  5 

75 

-  4 

25 

_ 

1 

03 

-  7-50 

1874 

+  8 

97 

+  0 

96 

+  0 

52 

+ 

4 

50 

+  10 

76 

+  7 

04 

+  7 

29 

+  10 

09 

+  12 

38 

+  8 

38 

+  8 

76 

+ 

1 

54 

+  5-63 

-  1 

24 

-  2 

59 

-  3 

09 

— 

3 

74 

-  2 

48 

-  3 

21 

-  4 

85 

-  4 

38 

-  1 

31 

-  1 

16 

-  1 

12 

0 

47 

-  1-89 

1876 

-  6 

22 

2 

03 

-  1 

70 

— 

4 

22 

-  4 

69 

-  3 

30 

-  4 

87 

-  7 

59 

-  6 

97 

-  2 

85 

-  5 

00 



1 

55 

-  0-94 

-  6 

05 

-  1 

17 

-  0 

49 

— 

5 

34 

-  8 

20 

-  8 

04 

-  0 

12 

-  3 

71 

-  8 

11 

-  9 

54 

-  8 

27 



1 

86 

-l-t-80 

1878 

-  2 

13 

+  1 

92 

+  2 

55 

+ 

2 

80 

+  3 

15 

+  5 

39 

+  10 

99 

+  8 

27 

+  3 

66 

+  1 

81 

+  5 

45 

+ 

3 

47 

+  1-54 

-14 

69 

-  5 

95 

-  6 

53 

— 

9 

55 

-12 

00 

-12 

25 

-13 

13 

-  9 

94 

-11 

34 

-11 

58 

-10 

29 

8 

65 

-  7-26 

1880 

+  V 

27 

+  1 

54 

+  5 

43 

+ 

7 

15 

+  5 

97 

+  2 

30 

-  5 

25 

+  4 

84 

+  4 

03 

+  5 

34 

+  4 

50 

+ 

3 

12 

+  4-99 

-  1 

35 

+   2 

45 

+  0 

30 

+ 

1 

12 

-  2 

17 

-  2 

58 

+  1 

76 

+  0 

65 

-  1 

56 

+  0 

48 

+  0 

91 

+ 

1 

20 

+  3-93 

1882 

+  0 

58 

+   0 

09 

-  0 

07 

+ 

2 

22 

+  6 

54 

+  3 

44 

+  7 

69 

+  9 

58 

+  9 

28 

+  4 

25 

+  8 

10 

+ 

3 

41 

+  2-69 

+  2 

44 

+  2 

56 

+  2 

96 

+ 

6 

37 

+  6 

36 

+  7 

83 

+  0 

71 

+  5 

68 

+  5 

24 

+  5 

92 

+  6 

37 

+ 

3 

17 

-  0-46 

1884 

-  1 

00 

+  1 

83 

+  2 

52 

+ 

4 

44 

+  8 

16 

+  9 

02 

+  8 

80 

+  11 

35 

+  10 

08 

+  4 

58 

+  9 

26 

+ 

0 

43 

+  4-80 

+  5 

90 

+  2 

63 

+   2 

01 

— 

0 

13 

+  0 

74 

+  1 

76 

+  7 

17 

+  1 

64 

+  1 

46 

-  1 

56 

-  0 

23 

0 

22 

+  3-57 

1886 

+  1 

53 

-  2 

44 

-  2 

15 

+ 

0 

43 

+  3 

84 

+  6 

83 

-  4 

49 

-  3 

51 

+  0 

28 

+  8 

25 

+  3 

68 

_ 

0 

02 

+  7-95 

-  0 

98 

+  2 

48 

+  0 

85 

+ 

0 

26 

-  2 

47 

-  1 

61 

+  4 

24 

+  1 

14 

+  3 

59 

-  3 

16 

+  1 

27 

— 

3 

05 

+  0-48 

1888 

+  1 

54 

-  1 

91 

-  1 

93 

+ 

0 

06 

+  3 

32 

-  1 

36 

-  5 

87 

-  9 

59 

-  3 

14 

+  3 

48 

-  1 

43 



0 

93 

+  1-33 

+  3 

33 

+  0 

18 

+  0 

99 

— 

0 

96 

-  2 

46 

-  1 

77 

-  5 

71 

-  4 

84 

-  2 

95 

-  4 

28 

-  3 

26 



4 

11 

-7-69 

1890 

— 

— 

— 

— 

— 

— 

1892 

-  5-61 

-  3-41 

-  4-46 



2-84 

-  2-57 

-  0-74 

+  7-03 

-  5-42 

-  4-00 

-  2-56 

-  4-31 

2-95 

-  3-03 

-  518 

-  2-28 

-  0-71 

— 

5-79 

-14-55 

-17-45 

-10-31 

-13-48 

-16-97 

-15-29 

-15-64 



3-18 

-11-87 

1894 

+  5-72 

+  4-80 

+  7-33 

+  12-71 

+  13-29 

+  9-41 

+  11-30 

+  17-87 

+  18-61 

+  15-72 

+  15-77 

+  11-57 

+  4-72 

+  3-86 

-  2-50 

-  0-04 

— 

4-22 

-  2-96 

+  0-09 

-  8-76 

-  5-19 

-  5-57 

-  3-49 

-  6-41 

— 

3-26 

-  3-52 

1896 

+  3-86 

+  3-39 

+  5-04 

+ 

4-55 

+  2-03 

+  4-15 

+  3-30 

+  3-05 

+  5-06 

-  4-58 

+  1-11 

+ 

110 

+  3-25 

-  2-88 

-  3-74 

-  2-34 

— 

5-81 

-  6-49 

-  3-20 

-  212 

-  4-84 

-  7-13 

-  4-72 

-  8-55 

4-98 

-  1-95 

1898 

-  1-99 

-  0-43 

-  2-16 

— 

4-21 

-  6-51 

-10-83 

+  1-65 

-  3-70 

-  4-12 

-  5-66 

-  3-50 



0-93 

-  6-27 

+  2-78 

-  0-75 

-  2-18 

— 

5-89 

-  3-28 

-  0-95 

+  3-81 

+  0-92 

-  0-10 

-  4-27 

+  0-88 



2-01 

-5-29 

1900 

-  606 

-  0-28 

-  0-04 

— 

4-96 

-  4-30 

+  4-16 

+  0-77 

-  1-77 

-  4-00 

-  2-72 

-  3-66 



3-78 

-  2-75 

+  0-85 

-  0-51 

-  0-21 

— 

1-44 

-  3-95 

+  2-88 

+  1-76 

+  0-69 

-  1-77 

-  2-47 

-  3-87 

+ 

2-40 

-  2-86 

1902 

+  3-41 

+  1-36 

+  1-05 

+ 

3-15 

+  5-50 

+  6-17 

+  5-00 

+  3-70 

+  5-20 

+  7-70 

+  6-67 

+ 

5-12 

+  5-85 

-  7-60 

-  4-09 

-  5-47 

— 

4-65 

-  5-29 

-  2-66 

-  4-98 

-  3-23 

-10-39 

-  4-71 

-  8-08 

8-75 

-  4-70 

1904 

-14-09 

-  7-00 

-  6-83 

— 

1016 

-11-51 

-12-87 

-  7-81 

-  5-98 

-11-42 

-  9-67 

-  7-14 

- 

10-67 

-  8-99 

1906 







. 

z 

~ 

~ 



+  2-91 

-  0-82 

-  117 

+ 

4-78 

+  5-77 

-  0-34 

+  10-45 

+  15-10 

+  16-01 

+  5-98 

+  14-58 

. 

1-98 

+  3-59 

1908 

+  7-76 

+  2-90 

+  3-96 

+ 

3-76 

+  6-53 

+  13-54 

+  5-04 

+  4-51 

+  7-95 

+  8-00 

+  4-93 

+ 

2-47 

+  6-73 

+  2-90 

-  0-01 

-  1-48 

— 

0-02 

+  3-21 

-  0-56 

-  6-89 

-  9-89 

-  4-63 

+  0-62 

-  4-36 

2-77 

+  4-39 

1910 

-  0-70 

-  1-26 

-  1-46 

+ 

0-93 

+  0-87 

-  4-02 

-  1-71 

+  4-21 

+  2-80 

-  101 

+  3-97 

_ 

1-41 

+  1-76 

+  7-92 

+  3-92 

+  3-65 

+ 

M6 

+  1-76 

+  5-81 

+  7-31 

+  4-67 

+  4-17 

+  4-16 

+  4-86 

+ 

2-82 

+  3-61 

1912 

-  9-77 

-  3-83 

-  5-28 

— 

13-03 

-16-12 

-18-99 

-12-68 

-12-33 

-17-11 

-18-62 

-15-95 

4-73 

—  15-91 

-  5-02 

-  3-10 

-  3-64 

— 

3-89 

-  6-3C 

-  3-26 

-  5-15 

-  3-77 

-  4-00 

-  5-79 

-  2-95 

_ 

3-37 

—  2-76 

1914 

+   3-65 

-  2-51 

-  0-62 

+ 

1-91 

+  o-9e 

+  0-23 

-  2-40 

-  0-49 

+  1-50 

-  4-16 

-  3-85 

- 

511 

-  5-19 

l'.ll(> 

-  6-70 

-  3-70 

-  2-98 

— 

4-60 

-  2-22 

-10-26 

+  4-11 

-  0-32 

-  3-55 

-  2-92 

-  3-81 

5-22 

-  0-94 

-10-99 

-  1-48 

-  1-31 

+ 

2-50 

+  0-17 

+  4-03 

-  1-60 

-  3-42 

-  316 

+  4-39 

-  1-84 

+ 

2-82 

+  2-12 

1918 

+  0-60 

-  0-oa 

-  2-11 

I 

319 

-  1-48 

+  2-05 

-  0-68 

-  0-93 

+  1-32 

-  0-74 

+  1-52 

+ 

2-11 

+  0-39 
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employed  was  that  of  successive  summation,  introduced  by  G-.  F.  Hardy,  By  this 
method  from  a  column  of  numbers  Xp  Xg,  ... ,  a;„  we  obtain  in  succession 

Si  =  a;,  +  ccj  +  ^3  +  ...  +  a;„  =  S  (a;) 

8^  =  nx^-j~{n~  1)3:3 ...  +  2x„_^-\-x^  =  8  {{%-{-  1  —r)x^} 

S.,  =  ^^^x.  +  (^^..+  ...  +3.„_.4-„  =  s{(^^  +  ^^ 

and  so  on.  The  successive  summations  carried  as  far  as  S^  lead  to  very  large  numbers, 
but  the  work  is  straightforward  and  easily  checked.  It  is  not  necessary  to  carry  out 
the  summation  of  the  column  in  one  piece.  It  may  with  advantage  be  broken  in  the 
middle  ;  for  example  in  the  majority  of  the  rainfall  analyses  the  first  29  terms  Were 
summed  as  above,  giving 

z,  =  S  (a;,) 


1 

20 


z,  =  8{{30-r)x,} 
1 

^»  r(30-r)(31-r)     1        , 
2:3  =  b  -^  ^^ -^-^- -'  x^  >  and  so  on  ; 

while  the  remaining  32  values  were  sununed  backwards  from  the  bottom  of  the  column, 
dropjjing  one  term  at  the  end  of  each  summation,  and  so  giving 


30 

Bl  61 

z,'  =  S  {(r  -  30)  I,}  =  -  S  {(30  -  r)  x,} 

^,  ^  g  I(r-30)(p31)  ^_|  ^  g  |(30-r)(31-,)     I  ^_,^, 

:i2    I  1.2  J  30    I  1.2  J 

Taking  now  alternately  sums  and  differences,  we  obtain 

S/  =  z,+z/  =  S(^,) 
1 

S/  =  z,,-3/  =  S{(30-r)a;,} 


so  on. 


S3'  =--  Zs  +  Z3'  =  S  1(3^^  -  ^)  (31  -  >•)  ^.^1  ^^^  go  o^  . 
1   l  1.2  J 

from  which  the  final  sums  of  the  whole  column  may  be  obtained  from  the  equations 

Si  =  s/ 

S3  =  S/  +  32S/ 

S3  =-.  S3'  +  32S;  +  5|^  s/. 
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The  series  Si,  Sg,  S3,  ,  by  whichever  method  it  is  obtained,  is  divided  by  the 

series  of  divisors 

61  .62  61  .  62  .  63 


61, 


1  .2 


1.2.3 


and  so  on, 


yielding  a  series  of  numbers  of  similar  magnitude 

61  .62      ^^  ^    "' 

^     1.2.3     ,,  lin+l-r){n  +  2-r)     ] 
61  .  62  .  63      I  1.2  1 

with  similar  equations  for  d,  e,  and/.  These  may  be  regarded  as  weighted  means  of 
the  series  re,,  ...,x„;  it  will  be  observed  that  the  weights  involve  r,  and  there- 
fore the  time,  up  to  the  5th  degxee.  The  series  is  therefore  equivalent  for  the  pur- 
pose of  evaluating  the  polynomial  coefficients  to  the  first  five  moments  of  rainfall  as 
distributed  in  time.  The  values  at  which  we  have  arrived  are,  however,  more  convenient 
for  computing  purposes  than  the  moments  would  be,  for  the  quantities  required  by  the 
method  of  Section  3  are 


Po  —  S  (xT^)  =~7=  S  (x)  —  \/n  a 


P.  ^  S (xTO  =  V,,(,f_i)  S itx)  =  V"i!,'\  '^  («  -  ^) 


180 


n(/r—  1)  (/i'  —  4) 


S    U 


^Ul 


_       /5n(n  +  l)(H-|-2)  ^^  _  3^      2c)  and  so  on. 

The  factors  under  the  square  root  are  not  necessary  for  the  correlational  work,  so  that 
actually  it  is  only  necessary  to  calculate  for  each  year 


a'  =  a 

h'  =  a~h 

c'  =  a  —  36  -j-  2c 

d'  =za  —  Qh+lQc  —  5d 

e' ^  a  ~  lOb  ~\- 30c  —  35d  +  lAe 

/'  =  a  —  156  +  70c  —  UOci  +  126e  —  42/. 


y. 


(V) 


These  are  the  independent  variates  in  terms  of  which  the  wheat  yield  is  to  be  expressed 
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in  the  form  of  ;i  regression  equation.     The  regression  coefficients  of  the  yield  uiion  these 
will  have  to  be  tlividcd  by  factors  of  the  form 


/(2s +1)  .  n{n +])...  jn  +  s) 
^  in-1)  ...(n-s) 


(m—  1)  ...  {n  —  s) 

in  order  to  give  the  coefficients  a,  of  Section  3,  but  as  in  the  expansion  of  the  regression 
function,  a,  is  multiplied  by 

/         {2s+l).{(-2s)\y 
^  {s\yn{n'~l)  ...(n'-s') 

we  actually  multiply  the  regression  coefficients  by 

(2.^) ! 
{s\yn{u-{-l)  ...  (m  +  s) 

in  order  to  obtain  the  coefficients  of  the  polynomial, 

«■'  +  ..., 
in  the  expansion  of  the  regression  function. 

6.  The  Rain  Distribution  Values. 

The  values  actually  obtained  for  the  quantities  a',  ...,  /',  for  the  65  i)eriods 
ending  August  31st,  1854  to  1918,  are  given  in  Table  IV  ;  for  tabulation  they  have 
been  multiplied  by  1,000  ;  thus  the  value  347  in  the  second  column  shows  that  the 
average  rainfall  for  the  first  period  was  347  thousandths  of  an  inch  every  6  days.  The 
figures  in  the  third  columii  measure  the  average  rate  at  which  rainfall  AVas  increasing  or 
decreasing  during  the  period,  as  indicated  by  a  straight  line  fitted  to  the  recorded  values  ; 
c'  measures  the  parabolic  term  in  the  rainfall  sequence,  and  so  on  taking  more  and  more 
complex  features  of  the  distribution  into  account. 

These  values  are  themselves  of  very  great  interest,  since  the  incidence  of  rainfall  has 
not  previously  been  analysed  in  this  way. 

The  individixal  peculiarities  of  the  successive  seasons  are  brought  out  clearly,  and  it 
is  possible  to  examine  the  sequence  of  years  by  adequate  statistical  methods.  We  may 
fii'st  enquire  whether  the  observed  sequence  accords  with  the  view  that  each  season 
is  an  independent  jsroduct  of  random  causes  under  constant  climatic  conditions,  or 
w'hether  on  the  other  hand  the  sequence  indicates  progressive  changes  in  the  quantity 
and  distribution  of  the  rainfall. 

The  series  of  65  values  of  a'  was  therefore  analysed  by  summation  in  the  same  nuinner 
as  the  61  values  of  the  rainfall  record  of  each  year  had  been  treated  ;  as  an  illustration 
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Table  IV. 


Har- 

Har- 

vest 

a' 

V 

c' 

(/' 

e' 

/' 

vest 

a' 

y 

c' 

iV 

e' 

/' 

Year. 
1854 

« 

Year. 

347 

-   14 

+  33 

-  16 

-  21 

-  1 

1887 

387 

-   82 

-  6 

+  29 

18 

+  14 

1855 

398 

+  90 

+  41 

+  8 

-  35 

-  22 

1888 

500 

+  48 

+  59 

-  20 

— 

10 

-13 

1856 

487 

-  18 

+  12 

+  1 

-  27 

+  35 

1889 

498 

+  72 

-  1 

-  11 

— 

36 

-10 

1857 

404 

-  9 

+  19 

+  15 

-  9 

-  1 

1890 

450 

+  20 

+  35 

-  1 

— 

15 

-12 

1858 

441 

-  69 

+  69 

-  38 

-  12 

+  15 

1891 

383 

+  65 

+  35 

+  22 

0 

+  28 

1859 

412 

+  64 

+  3 

-  14 

-  4 

-  5 

1892 

487 

-  29 

+  70 

+  45 

— 

35 

+  29 

1860 

598 

+  39 

+  35 

-  12 

-  12 

+  11 

1893 

398 

-  38 

+  37 

+  26 

— 

16 

-  4 

1861 

381 

-  8 

+  3 

-  17 

-   24 

+  1 

1894 

488 

—  7 

+  24 

+  30 

— 

8 

+  16 

1862 

451 

+  22 

-  13 

-  13 

-  21 

+  18 

1895 

474 

-  3 

+  66 

+  51 

— 

42 

-11 

1863 

414 

-  21 

+  36 

+  19 

-  7 

+  14 

1896 

399 

-  27 

+  21 

+  32 

— 

4 

+  13 

1864 

331 

-  56 

+  16 

-  24 

+  3 

+  3 

1897 

618 

-131 

+  77 

-  32 

+  55 

-  5 

1865 

454 

+  43 

+  61 

-  30 

+  29 

-  11 

1898 

323 

-  11 

+  9 

-  20 

+ 

4 

-17 

1866 

568 

—  2 

+  3 

+  38 

-  45 

+  8 

1899 

405 

-  20 

-  42 

+  27 

17 

+  5 

1867 

513 

-  14 

+  21 

-  27 

+  7 

-  17 

1900 

514 

-  47 

+  27 

+  35 

+ 

4 

+  2 

1868 

346 

-  11 

+  14 

+  26 

+  42 

-  11 

1901 

406 

-  10 

-  7 

+  31 

+ 

2 

+  11 

1869 

426 

-  44 

-  37 

+  10 

-  3 

-  3 

1902 

382 

+  30 

+  '18 

+  12 

5 

+  2 

1870 

354 

-  65 

+  10 

+  21 

+  9 

-  11 

1903 

520 

+106 

+  28 

-  12 

— 

8 

-16 

1871 

451 

-  7 

+  16 

-  23 

-  32 

-  16 

1904 

517 

-  75 

+  30 

+  32 

+ 

2 

+  7 

1872 

475 

+  8 

+  13 

-  23 

+  5 

-  6 

1905 

428 

+  35 

+  58 

0 

+ 

17 

+  11 

1873 

503 

-  57 

+  6 

+  64 

-  31 

+  2 

1906 

389 

-  37 

-  13 

+  11 

9 

-15 

1874 

357 

-  12 

+  30 

-  10 

-  3 

-  6 

1907 

481 

-  44 

+  13 

+  5 

— 

47 

+  24 

1875 

526 

-  11 

+  42 

-  33 

-  26 

-  42 

1908 

494 

-  5 

+  6 

+  24 

— 

2 

+  38 

1876 

524 

-  80 

+  29 

+  18 

-  8 

+  42 

1909 

419 

+  52 

+  26 

-  26 

— 

5 

-  7 

1877 

648 

-  76 

+  9 

+  22 

+  23 

-  38 

1910 

507 

-  16 

+  16 

+  23 

— 

11 

+  5 

1878 

532 

+  38 

+  10 

+  20 

+  1 

+  35 

1911 

46J) 

-  43 

-  38 

+  25 

— 

35 

+  11 

1879 

674 

+105 

+  37 

+  26 

-  35 

-  3 

1912 

672 

+  18 

+  33 

+109 

^- 

9 

-  2 

1880 

350 

+  39 

+  25 

-  29 

-  7 

-  24 

1  1913 

451 

-  54 

-  35 

+  20 

— 

2 

+  18 

1881 

603 

-  78 

+110 

+  35 

+  41 

+  16 

1914 

419 

-  42 

-  14 

-  10 

+ 

8 

-57 

1882 

530 

-  11 

-  8 

+  5 

-  28 

+  15 

1915 

604 

-  14 

-  36 

+  65 

29 

-55 

1883 

573 

-  81 

+  6 

+  25 

-  39 

-  14 

1916 

582 

-  17 

-  26 

+  51 

+ 

15 

+  14 

1884 

422 

-  53 

+  35 

+  1 

0 

+  8 

1917 

577 

+  54 

+  82 

+  58 

25 

—  2 

1885 

434 

-  20 

-  37 

-  11 

+  9 

0 

1918 

449 

-  18 

+  15 

-  22 

— 

11 

+  23 

1886 

508 

-  83 

+  33 

-  30 

-  13 

0 

of  the  metliod  the  numerical  vahies  are  given  in  Tables  V  and  M  ;  before  summation 
each  a'  was  reduced  by  400,  the  true  value  of  the  mean  being  inserted  in  the  fourth  column. 
The  second  colunm  gives  the  successive  sums,  Sj  to  S^  ;  from  these  the  third  cohunn  is 
derived  by  dividing  by  factors  of  the  form 


(•).5  .  6r)...(e)5  -!-r) 


r=  J,...,  5. 


Thus  the  third  column  gives  the  values  of  a  to/for  tlie  series  a'.  The  fourth  column  gives 
the  corresponding  values  a'  to/'  for  the  series  «',  obtained  by  equations  (V)  ;  while  the 
fifth  column,  obtained  by  multiplying  by  factors  of  the  form 


x/ 


(2r+  1).  65  ...  (05 
64  ...  (65  —  r) 


"Ty 
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are  the  actual  values  of  the  fii-st  five  transformed  co-ordinates  spoken  of  in  Section  4, 

3jS  3/2    T/O  OOq  • 

Table  V. — Analysis  of  sequence  of  values  of  a'. 


— 

Sj  to  Sg. 

a  to/. 

a' to/'. 

X./  to  Xg'. 

1 

2 
3 
4 

5 
6 

4,521 

128,341 

2,603,055 

40,090,879 

500,901,859 

5,313,933,165 

69-55385 

59-832634 

54-3378562 

49-22841040 

44-570118750 

40-528492120 

469-55385 
+9-72122 
-1-26834 
+7-79456 
+2-35049 
-2-62489 

+137-85 

-  23-95 
+182-35 
+  66-31 

-  88-43 

In  an  unchanging  series  the  values  X2',  x^  . . .  vary  about  zero  in  an  approximately 
normal  distribution,  the  standard  deviation  of  which  may  be  obtained  from  that  of  the 
original  series  ;  for 

is  the  sum  of  the  squares  of  the  deviations  of  the  original  series  from  their  mean.  Slow 
changes  in  the  original  series  will  be  indicated  by  high  positive  or  negative  values  in 
X2,  x^,  ...  ,  and  if  such  slow  changes  are  suspected,  it  will  be  better  to  estimate  the 
variance  due  to  random  causes  from 

from  which  the  first  five  values  have  been  omitted.  From  the  sums  of  the  squares  of 
the  deviations  we  may  thus  obtain  a  series  of  values  each  obtained  from  the  last  by 
deducting  the  square  of  the  corresponding  value  xj;  from  each  such  sum  may  be  obtained 
an  estimate  of  the  standard  deviation  due  to  random  causes,  by  dividing  by  the  number 
of  squares  concerned  (degrees  of  freedom),  and  taldng  the  square  root.  Such  estimates 
will  be  equivalent  to  those  derived  from  the  residuals  left  after  polynomials  of  the  first 
to  the  fifth  degree  have  been  successively  fitted  ;  but  the  labom"  of  calculating  the 
polynominal  values  is  avoided. 

Table  VI. 


Degrees  of  freedom. 

Sum  of  squares. 

Mean  square. 

Standard  deviation. 

64 

465,105 

7267-3 

85-25 

63 

446,103 

7081-0 

84-15 

62 

445,529 

7186-0 

84-77 

61 

412,278 

6758-7 

82-21 

60 

407,881 

6798-0 

82-45 

59 

400,061 

6780-7 

82-35 

e2 
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The  diminution  of  the  estimates  of  the  standard  deviation  indicates  that  the  first 
5  values  of  a;'  are  on  the  whole  higher  than  those  which  follow  ;  for  example  a?/  is  more 
than  double  the  standard,  and  suggests  strongly  that  real  changes  are  taking  place  in  the 
rainfall.  To  test  this  more  accurately,  divide  the  sum  of  the  squares  of  the  five  devia- 
tions by  the  mean  square  of  the  remainder,  then 


465105  —  4000G1 

67807 


=  959, 


whence,  entering  Elderton's  table  with  n!  =  6,  we  obtain  P  =  -089.  Thus  a  larger 
value  of  7.2  would  be  obtained  by  chance  only  8  •  9  times  in  a  hundred,  from  a  series  of 
values  in  random  order.  There  is  thus  some  reason  to  suspect  that  the  distribution  of 
rainfall  in  successive  years  is  not  wholly  fortuitous,  but  that  some  slowly  changing  cause 
is  liable  to  affect  in  the  same  direction  the  rainfall  of  a  number  of  consecutive  years. 
Another  way  of  putting  the  same  result  is  that  the  variance  estimated  from  the  residuals 
of  a  polynomial  of  the  5th  degree  is  93-3  per  cent,  of  the  variance  of  the  original  series, 
so  that  some  6-7  per  cent,  of  the  total  variance  observed  in  annual  rainfall  may  be 
ascribed  to  slow  changes,  while  the  remaining  93  •  3  per  cent,  of  the  variance  are  due  to 
the  chance  circumstances  of  each  particular  year. 

We  summarise  below  the  results  of  applying  a  similar  analysis  to  the  rainfall  dis- 
tribution values  a'  to  /' : — 

Table  VII. 


a' 

V 

c' 

d' 

e' 

/' 

Mean 

469-55 

-   11-09 

+  19-22 

+     9-43 

—     8-69 

+    0-57 

X,' 

+137-85 

-  18-62 

-  35-01 

+  86-92 

+     8-78 

-     2-11 

X- 

-  23-95 

+  56-19 

-  33-75 

+     5-45 

-  16-94 

-    0-84 

x^ 

+182-35 

-  35-31 

-  33-96 

+  13-02 

+  15-72 

-  23-80 

^5' 

+  66-31 

+  13-00 

+  45-52 

+    5-76 

-  29-78 

+     1-25 

^e' 

-  88-43 

+  24-69 

+  33-79 

-  20-47 

+  11-50 

+  22-33 

Standard 
Residue 

82-35 

50-43 

30-11 

28-19 

20-76 

19-91 

x" 

9-59 

2-17 

7-42 

10-33 

3-78 

2-70 

P 

0-089 

0-83 

0-19 

0-067 

0-58 

0-74 

1 

It  is  quite  clear,  from  the  values  of  P.  that  no  significant  changes  are  observable  in 
b' ,  c',  e'  and/'.     These  fluctuate  with  large  standard  (le\iations  about  the  mean  values 


ON   THE   YIELD   OF  WHEAT   AT   ROTHAMSTED. 


117 


given  in  the  first  line  of  Table  VII,  whicli  mean  values  have  not  significantly  changed 
during  the  65  years  examined.  With  «',  and  more  clearly  with  d',  the  mean  appears 
to  show  gradual  changes.  In  the  latter  case  the  change  observable  is  very  clear,  and  of 
a  simple  character.  The  value  of  x^',  representing  a  linear  increase,  exceeds  the  standard 
deviation  in  the  ratio  3-083;  the  probability  of  such  a  value  occurring  by  chance 
is  only  0-00205.  The  remaining  values,  x.^'  to  x/,  are  individually  and  collectively 
insignificant,  whence  it  appears  that  the  slow  change  in  d'  is  significant,  and  may  be 
represented  by  a  uniform  increase.  This  is  in  sharp  contrast  to  the  behaviour  of  a' , 
a  contrast  which  is  brought  out  by  plotting  successive  10  year  means  of  these  two 
quantities. 


I860 


1870 


1880 


1890 


1900 


1910 


1920 


Fig.  1. 


In  the  case  of  the  total  rainfall  (fig.  1)  measured  by  a',  there  would  seem  to  have  been  an 
excess  of  rain  for  a  series  of  years  about  1879,  and  a  second  spell  of  about  equal  intensity 
about  1914  ;  the  interval  being  35  years.  At  these  periods  the  annual  rainfall  average 
rises  to  about  32  inches.  Prior  to  the  first  wet  spell,  and  in  the  intermediate  period, 
the  average  annual  rainfall  is  about  27  inches.  Such  a  change  is  small  compared  to  the 
annual  fluctuations  ;  we  have  seen  that  it  accounts  for  only  about  6-7  per  cent,  of  the 
variance,  and  that  the  whole  effect  is  scarcely  significant  in  our  data.  The  distribution 
value  d',  on  the  other  hand,  which  measures  not  the  total  rainfall  but  its  distribution 
over  the  year,  shows  a  distinct  and  aj^parently  uniform  increase.  Judging  from  the 
straight  line  (fig.  2),  its  mean  value  has  changed  from  —  10  in  1854,  to  nearly  +  29  in 
1918  ;  its  mean  value  +  9-43,  given  above,  is  thus  not  a  permanent  feature  of  the  climate 
of  the  district.  As  in  all  rainfall  features  the  annual  fluctuation  is  very  great ;  of  the 
variation  observed  in  d'  in  the  65  years  only  12-3  per  cent,  is  ascribable  to  the  linear 
change,  the  remaining  87-7  per  cent,  being  apparently  due  to  random  fluctuations. 
In  the  absence  of  a  similar  analysis  of  rainfall  at  other  stations,  it  would  be  premature 
to  discuss  the  possible  causes  of  this  remarkable  and  progressive  change  in  the  climate. 
It  may  be  remarked  that  such  little  additional  information  as  is  to  be  obtained  from  the 
monthly  records  indicates  that  the  most  marked  feature  of  the  change  in  progress  is  an 
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Ten  year  means  of  the  distribution  value  <V . 
increase  in  the  December  rainfall  with  perhaps  some  relative  reduction  of  rain  in  Spring 
and  Autumn. 

The  mean  values  of  c'.  d! ,  and  e',  differ  significantly  from  zero,  owing  to  the  seasonal 
variations  of  rainfall.  The  series  of  mean  values,  neglecting  the  mean  value  of/'  which 
is  altogether  negligible,  represent  the  mean  course  of  the  rainfall  sequence  during  the 
year.  This  is  represented  in  fig.  3,  where  the  mean  monthly  values  have  been  inserted 
for  comparison  ;  these  values  have  been  reckoned  fer  day  to  eliminate  the  effect  of  the 
varying  lengths  of  the  months.  The  monthly  means  are  of  course  subject  to  large 
probable  errors,  and  the  smooth  curve  gives  a  slightly  better  estimate  of  the  average 
sequence  ;  this  sequence  is  necessarily  in  process  of  modification  owing  to  the  j^ro- 
gressive  increase  of  d'. 

The  standard  deviations  of  the  six  distribution  values  are  in  good  mutual  agreement 
ajid  evidently  arise  from  a  common  cause,  namely  the  random  fluctuation  of  rain  in  a 
six-day  period.  'I'his  may  be  seen  by  multiplying  them  by  the  factors  necessary  to 
transform  a! /'  into  p,,  p,.  ••••  Ph  (Section  5),  that  is  by 


v'eT.  V?^*- 
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SEASONAL  VARIATION  IN  AVERAGE  DAILY   RAINFALL 
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Fig.  3. 

the  standard  errors  of  pi,  ...  ,  p^  are  then 


ffp,. 
643 


693 


552 


643 


573 


660 


Mean. 
627 


These  may  be  regarded  as  independent  estimates  of  the  stantlard  deviation  of  the 
rainfall  (in  thousandths  of  an  inch)  in  a  six-day  period,  each  derived  from  59  independent 
squares,  which  is  equivalent  to  a  random  sample  of  60  values.  The  variation  of  these 
estimates  is  just  of  the  order  to  be  expected  on  the  basis  of  random  samjiling.     The 


c'. 

d'. 

e'. 

/'• 

Standard 
error. 

+  0-232 

+  0-566 

+  0-614 

-0-546 

±  0-306 

+  0-354 

+  0-510 

+  0-920 

+  1-006 

±  0-612 
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variation  of  the  rainfall  distribution  values  is  thus  nearly  what  we  should  deduce  from 
the  supposition  that  in  addition  to  the  small  contributions  made  by  secular  and  seasonal 
variation,  each  6  day  period  varied  independently  with  a  standard  deviation  0  •  627  inch, 
or  a  coefficient  of  variation  of  134  per  cent. 

The  distribution  of  the  distribution  values  are  not  materially  skew  as  far  as  can  be 
judged  from  a  series  of  65  values.  The  following  values  obtained  from  the  moments 
of  the  sample  shoAV  the  observed  deviations  from  normality,  with  the  standard  errors 
appropriate  to  a  normal  jiopulation 

a'.  v. 

y,  =  yp;  +  0-493     +  0-336 

Y^  =  p,-  3        —  0-290     +  0-066 

These  figures  suffice  to  show  that  the  observed  values  hardly  differ  significantly  from 
samples  of  normal  distributions,  although  there  are  indications  that,  if  the  values  for  a 
considerably  longer  series  of  years  were  available,  real  departures  from  the  normal 
distribution  would  become  apparent. 

7.  Correlations  of  Rainfall  Distribution  Values. 

The  rainfall  distribution  values  have  been  calculated  from  uncorrelated  functions 
of  the  time,  and  if  the  rainfall  at  different  periods  of  the  year  were  uncorrelated  the  rain- 
fall distribution  values  should  also  be  uncorrelated.  If,  on  the  other  hand,  rainfall 
at  one  time  of  the  year  were  associated  with  rainfall  or  lack  of  rainfall  at  another  time, 
then  correlations  would  appear  in  the  rainfall  distribution  values,  and  such  correlations 
would  indicate  the  nature  and  extent  of  such  mutual  correlations  between  different  parts 
of  the  season.  Hooker  observed,  in  the  short  period  of  21  years  at  his  disposal,  a  correla- 
tion as  high  as  +  0-6  between  the  winter  rainfall  (1st  to  8th  weeks)  and  that  of  the  pre- 
vious autumn  (37th  to  44th  weeks),  and  concluded  that  so  large  a  correlation  indicated  a 
real  interdependence  between  winter  and  autumn  rainfall.  More  recently  W.  T.  Russell 
(13.  1922)  has  calculated  the  correlation  coefficients  for  monthly  rainfall  between  con- 
secutive and  between  alternate  months.  For  Greenwich  (65  years)  no  appreciable  corre- 
lations are  found  between  successive  months,  but  in  the  alternate  months  it  is  found 
that  June  and  August  have  a  positive  correlation  +0-55.  Such  a  value  should  occur 
by  chance  from  uncorrelated  material  once  only  in  900,000  times,  so  that  even  allowing 
for  the  fact  that  it  is  the  highest  of  the  24  values  obtained,  it  would  appear  to  be  sig- 
nificant of  a  real  association.  It  should,  however,  be  borne  in  mind  that  any  secular 
changes  either  in  total  rainfall,  or  in  its  distribution  tlirough  the  year,  as  may  exist  at 
Greenwich,  have  not  been  eliminated  in  this  coefficient,  and,  in  the  second  place,  the 
distribution  of  monthly  rainfall  is  far  from  normal,  and  in  consequence  the  probable 
error  of  such  a  determination  may  be  distinctly  higher  than  that  calculated  for  normal 
distributions. 
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The  correlations  obtained  between  tlie  G5  values  a',  ...  ,/'  are  not,  in  any  case, 
high.  The  transformed  values  (z  =  tanh~'  r)  are  set  out  in  Table  VIII,  since  these 
values  are  distributed  in  random  samples  in  approximately  normal  distributions,  with 
the  same  standard  error  ±  0-127. 

Table  VIII. — Values  of  z  =  tanh"'  r. 


a'. 

b'. 

c'. 

d'. 

e'. 

V 

-0-01 

_ 

c' 

+0-19 

+0-09 

— • 

' — ■ 

— 

d' 

+0-38 

-0-07 

-0-09 

— 

— 

e' 

-0-06 

-0-23 

+0-18 

-0-16 

— 

f 

-0-03 

-0-08 

+0-08 

+0-16 

-0-06 

None  of  these  correlations  excite  remark  except  that  between  a'  and  d' ;  these  two 
variates  are  the  two  which  show  secular  variation,  and  it  is  therefore  of  more  interest 
to  obtain  the  corresponding  values  after  the  secular  variation  has  been  removed.  These 
are  found  by  treating  the  sum  of  the  products  of  deviations  as  the  sum  of  squares 
has  been  treated  in  Table  VI,  deducting  successively  the  products  of  the  values 
X2.  ... ,  xj,  for  the  two  variates  concerned,  calculating  r  from  the  sums  of  squares 
similarly  treated,  and  transforming  to  the  z  scale  as  before.  The  values  so  obtained  are 
shown  in  Table  IX,  with  standard  error  ±  0  •  1325. 


Table  IX. — Values  of  z  after  Eliminating  Secular  Change. 


a'. 

b'. 

c'. 

d'. 

e'. 

v 

+0-0342 

_ 

c' 

+0-2880 

+0-0800 

— 

— 

— 

d' 

+0-3103 

-0-0526 

-0-0228 

— 

— 

e' 

-0-0784 

-0-2100 

+0-2380 

-0-1955 

— 

r 

+0-0356 

-0-1119 

+0-0381 

+0-2092 

-0-0525 

The  correlation  of  a'  with  d'  now  exceeds  its  standard  error  in  the  ratio  2  •  34  ;  the 
probability  of  obtaining  so  large  a  value  by  chance  from  uncorrelated  material  is  0-019, 
and  since  this  is  the  largest  value  of  15,  it  can  scarcely  be  regarded  as  proof  of  associa- 
tion.    To  test  the  values  collectively  from  the  series  of  their  squares,  we  obtain 


7.2  =  22-63, 


n'  =  16, 


P  =  0-093, 


from  Elderton's  Table,  showing  that  although  there  are  signs  of  association  among  the 
rainfall  distribution  values,  such  association,  if  it  exist,  is  not  strong  enough  to  show  up 
significantly  in  a  series  of  about  60  values. 
vol.  CCXIII. — B.  s 
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8.  The  Regression  of  Yield  on  the  Distribution  Values,  and  on  the  Rainfall  at  Different 

Seasons. 

The  sums  of  squares  and  products  of  the  distribution  values  provide  a  basis  for  the 

calculation  of  the  partial  regression  coefficients  of  the  yields  upon  them,  but  before 

applying  them  in  this  way  it  is  necessary  to  allow  for  the  fact  that  5  years'  yields,  namely, 

1890,  1891,  1905,  1906  and  1915,  have  been  omitted  ;    we  must  therefore  deduct  from 

the  sums  the  contributions  of  these  years.     Table  X  shows  the  polynomial  values  of  «', 

b',  ...  ,/'  for  these  years  : — 

Table  X. 


Year. 

t. 

a'. 

b'. 

c'. 

d'. 

e'. 

/'- 

1890 

4 

465-64 

-13-21 

33-17 

9-11 

-  9-73 

4-16 

1891 

5 

461-66 

-12-36 

33-36 

9-10 

-  9-42 

4-84 

1905 

19 

458-63 

-11-96 

10-57 

18-88 

-  6-03 

2-00 

1906 

20 

463-62 

-12-52 

7-98 

20-29 

-  6-17 

0-97 

1915 

29 

524-83 

-  8-98 

2-73 

30-09 

-10-66 

-  4-13 

calculated  from  the  polynomial  coefficients  of  Table  VII.  These  values  were  subtracted 
from  the  observed  values  of  Table  IV,  and  the  squares  and  products  of  the  deviations 
deducted  from  the  corresponding  sums.  The  corrected  sums,  answering  now  to  54 
degrees  of  freedom,  were  as  follows  : — 

Table  XI. 


— 

a'. 

b'. 

c'. 

d'. 

e'. 

/'- 

a' 

+380,853 

+  14,984 

+44,122 

+38,013 

-  5,278 

+  8,121 

V 

+  14,984 

+  140,110 

+  4,031 

-  4,241 

-14,577 

-  8,921 

d 

+  44,122 

+    4,031 

+49,.302 

+      906 

+  6,746 

-  1,392 

d' 

+  38,013 

-    4,241 

+      906 

+44,944 

—  5,790 

+  8,163 

e' 

-    5,278 

-  14,577 

+  6,746 

-  5,790 

+24,429 

-  2,769 

r 

+    8,121 

-    8,921 

-  1,392 

+  8,163 

-  2,769 

+19,670 

Since  we  require  to  find  the  partial  regressions  for  13  separate  plots,  it  is  wortli  while 
to  invert  the  determinant  formed  of  these  nmiibers,  so  as  to  obtain  a  matrix  of  multipliers 
each  of  which  is  the  co-factor  of  the  corresponding  nmnber  above,  divided  by  the  deter- 
minant,    'i'able  XII  shows  these  multipliers  in  millionths. 


Table  XII. 

— 

a'. 

b'. 

c'. 

d'. 

e'. 

/. 

a 

+3-235054 

-0-304445 

-  2-947149 

-  2-535496 

+  0-672526 

-  0-542793 

b 

-0-304445 

+  8-094433 

-  1-082929 

+  1-042.372 

+  5-775040 

+  4-100483 

c 

-2-947149 

-1-082929 

+24-070588 

+  0-730487 

-  7-637592 

+  1-050675 

d 

-2-53.5496 

+1-042372 

+  0-730487 

+26-747433 

+  5-214984 

-  8-794578 

e 

+0-672526 

+5-775040 

-  7-637592 

+  5-214984 

+48-604978 

+  6-478923 

f 

-0-542793 

+4-100483 

+  1-050675 

-  8-794578 

+  6-478923 

+57-557904 
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For  each  plot  six  correlation  tables  were  constructed  with  the  six  rainfall  distributions 
values,  using  the  values  in  Tables  III  and  IV,  corresponding  to  the  60  years  available. 
The  six  sums  of  products  obtained  from  these  tables,  multiplied  by  the  values  of  any 
column  of  Table  XII  and  added,  give  the  regression  of  yield  on  the  coiTesponding  rain 
variate.  Table  XIII  gives  the  values  obtained  for  these  regression  coefficients,  the  crop 
being  measured  in  bushels  per  acre. 

Table  XIII. 


Plot. 

Regression  on — 

a'. 

b'. 

c'. 

d'. 

e'. 

/'- 

2b 

-  40-9895 

+ 

2-1969 

8-0261 

46-2368 

15-8457 

+  24-0548 

3+4 

-  20-2933 

0-0582 

+ 

0-1506 

+ 

13-7263 

+ 

14-2433 

+  27-0821 

5 

-  20-7766 

— 

3-1878 

0-0575 

+ 

13-9806 

+ 

7-2414 

+  18-8043 

6 

-  39-7077 

— 

0-6177 

+ 

34-1075 

— 

5-7556 

— 

8-3922 

+  51-2116 

7 

-  42-4430 

— 

3-1822 

-t- 

36-3066 

— 

31-9965 

— 

16-6956 

+  83-5923 

8 

-  43-8350 

— 

11-7970 

+ 

35-9850 

— 

31-6292 

+ 

8-1368 

+110-1433 

10 

-  25-7070 

— 

20-2730 

15-3172 

— 

15-6733 

+ 

35-8060 

+140-2458 

'11 

-  30-4546 

— 

14-8637 



9-8165 

+ 

1-6230 

+ 

17-7579 

+113-9780 

12 

—  55-9573 

— 

13-9676 

+ 

20-9149 

— 

22-4509 

— 

23-0060 

+113-0384 

13 

-  45-1447 

— 

3-9169 

+ 

63-5960 

— 

48-0067 

— 

24-4347 

+  92-2415 

14 

-  42-6026 

— 

12-9980 

+ 

25-9731 

— 

30-7156 

— 

24-8636 

+115-5578 

17&18\ 
miner'ls/ 

-  24-0377 

+ 

0-5955 

+ 

17-2943 

+ 

5-6818 



3-8642 

+  33-4268 

17&18\ 

-  42-6939 

+ 

3-8539 

+ 

60-9060 



44-3881 

+ 

2-7304 

+120-5257 

am'oniaj" 

These  coefficients  give  directly  linear  regression  formulae  expressing  the  deviation 
of  the  wheat  crop  on  each  plot  in  terms  of  the  rainfall  distribution  values.  Equally,  as 
explained  in  Section  3,  they  enable  us  to  estimate  the  average  benefit  or  loss  in  bushels 
per  acre  ascribable  to  an  additional  inch  of  rain  at  any  time  dming  the  year.  For  this 
purpose  we  divide  the  six  regressions  corresponding  to  any  plot  by 


(2r) 


61  .  62  ...  (61  +r), 


and  we  have  the  six  coefficients  of  t',  t,  f  ...  in 

a  =  A  +  B<  +  C(f-«2)+..., 
expressing  a  in  terms  of  the  polynomials  of  Section  3,  wherein  t  is  the  time  in  6-day 


intervals  measured  from  the  central  period  of  the  year, 
the  function  a  throughout  the  year. 


Figs.  4  to  8  show  the  course  of 


9.  Discussion  of  Figs.  4  <o  8. 
It  shoidd  be  emphasised  that  the  information  provided  by  a  comparison  of  the  rain 
record  \vith  the  subsequent  yields  tells  us  the  efEect,  not  of  so  much  rain,  as  such,  but  of 

s2 
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the  total  meteorological  ptenomena  in  fact  associated  with  rain,  at  the  time  of  the  year 
considered.  Thus  in  our  records  rain  is  associated  with  lower  temperatures  in  sunmier, 
in  winter  with  higher  temperatures  ;  and  generally  with  diminished  sunshine.  The 
efEects  of  these,  to  the  extent  in  which  they  are  associated,  will  be  incorporated  in  the  total 
effects  shown  in  the  charts,  and  are  in  fact  an  integral  part  of  the  value  of  a  rain  record 
as  a  means  of  foreseeing  the  prospects  of  the  crop. 

Even  in  the  case  of  the  rainfall  itself,  however,  a  detailed  consideration  of  the  ways 
in  which  it  may  afEect  the  crop  would  lead  to  a  most  intricate  discussion.  It  would 
be  of  the  greatest  value  to  know  how  important  to  the  final  crop,  and  how  frequently 
influential,  are  not  only  the  actual  moisture  available  in  the  soil,  but  also  the  degree  of  soil 
aeration  ;  how  frequently,  and  to  what  extent,  root  development  is  hindered  by  soil 
saturation,  or  by  an  accumulation  in  toxic  concentration  of  carbon  dioxide.  We  shoidd 
expect  these  factors  to  be  intimately  connected  with  rainfall,  both  in  its  direct  effect 
in  supplying  fully  aerated  water,  and  in  its  indirect  efEects  upon  the  soil  texture. 

In  a  single  curve  showing  the  average  effect  of  rainfall  (and  of  the  average  weather 
associated  with  such  rainfall)  upon  the  crop  ultimately  produced,  all  such  contributory 
causes  are  included  ;  by  the  comparison  of  the  curves  obtained  from  different  plots, 
representing  different  manurial  conditions  of  the  soil,  we  may  infer  the  effect  of  rainfall 
upon  the  availability  of  the  manurial  constituents  of  the  soil.  As  will  appear  more  fully, 
the  predominant  feature  of  such  a  comparison  is  the  influence  of  excessive  rain  in  removing 
soluble  nitrates  ;  this  effect  masks  and  overshadows  all  others,  partly,  I  would  suggest, 
by  reason  of  its  intrinsic  importance  to  the  wheat  crop  in  our  wet  climate,  partly  because 
the  plots  on  Broadbalk  show  great  extremes  in  the  relative  abundance  of  available 
nitrogen. 

The  predominance  in  these  curves  of  the  effects  which  appear  to  be  ascribable  simply 
to  the  removal  of  soluble  nitrate  may  perhaps  explain  two  features  which  otherwise 
might  be  unexpected.  In  the  first  place  the  greater  part  of  the  effect  of  rainfall  is 
expressible  by  means  of  the  linear  relations  in  the  manure  here  represented  ;  the  quadratic 
terms,  though  of  great  interest  and  well  worth  further  study,  are  of  much  less  quantitative 
importance.  Considering  merely  the  effects  of  rain  on  the  general  environment  of  the 
growing  plant  we  might  expect  definite  optimum  conditions  throughout  the  year  to  be 
well  marked,  with  correspondingly  important  quadratic  terms ;  but  the  effect  of  nitro- 
genous fertilisers  is  not  only  approximately  linear  over  the  range  concerned,  but  since 
increasingly  heavy  drainage  generally  removes  decreasing  quantities  of  nitrates,  the 
tendency  of  its  effect  is  to  reverse  the  curvature  of  the  regression.  In  the  second  place 
it  might  have  been  expected  that  the  effects  of  rainfall  woidd  be  closely  related  to  the 
total  crop  on  the  different  plots,  and  that  resemblance  woidd  be  more  clearly  apparent 
in  the  proportionate  than  in  the  absolute  effects  ;  but  since  for  a  wide  range  of  manurial 
condition  the  absolute  effects  of  additional  nitrogen  are  not  very  unequal,  and  tend  to 
decrease  with  increasing  crop,  it  is  apparent  that  the  similarities  in  the  absolute  effect, 
which  we  observe  for  example  in  plots  8  and  14,  are  more  probably  due  to  quantitative 
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changes  in  nutrition  than  to  indirect  modifications  of  the  physiological  condition  of  the 
crop. 

As  will  be  seen  from  Table  II,  the  13  plots  studied  may  be  arranged  in  the  following 
classes  : — 

(i)  Low  fertility,  no  added  nitrogen.      Plots  (3  and  4),  5  and  (17  and  18)  mineral 

series, 
(ii)  Progressively  higher  yields  due  to  progressive  increase  in  nitrogenous  dressings, 

from  plot  6,  through  plots  7,  13  and  (17  and  18)  ammonium  series,  to  plot  8. 
(iii)  Dressings  which  have  produced  an  increasingly  unbalanced  nutrition  with 

diminution  of  yield.     Plots  12  and  14. 
(iv)  Unbalanced  nutrition  produced  earlier  and  more  intensely.     Plots  10  and  11. 
(v)  Farmyard  manure  containing  the  very  heavy  contribution  of  about  200  lb. 

of  nitrogen,  with  organic  matter  which  has  materially  lightened  the  soil. 
An  examination  of  these  diagrams  shows  how  intimately  the  response  of  the  crop  to 
weather  is  connected  with  the  manurial  condition  of  the  soil.  Classing  the  plots  solely 
by  inspection  of  the  curves  of  response  to  rainfall  we  shall  put  together  every  case  in 
which  the  manurial  treatment  is  alike,  and  indeed  the  whole  series  of  curves  arrange 
themselves  in  sequence  of  order  of  increasing  abundance  of  nitrogenous  fertilisers.  As 
preliminary  to  a  fuller  discussion  we  may  note  (i)  that  in  all  the  plots  the  average  effect 
of  additional  rain  is  harmful.  This  agrees  with  Hooker's  finding  for  Eastern  England, 
(ii)  In  all  save  the  dunged  plot  2b,  the  average  loss  is  rapidly  reduced  during  August, 
an  observation  which  finds  a  simple  explanation  in  the  fact  that  the  average  date  of  carting 
the  crop  is  August  24th,  so  that  as  August  advances  an  increasing  proportion  of  years 
occur  in  which  the  rain  is  too  late  to  affect  the  crop.  The  exceptional  behavioiu  of 
plot  2b  indicates  that  the  average  loss  per  inch  of  rain  in  the  month  preceding  harvest 
is  even  heavier  than  that  shown  in  the  diagram,  (iii)  In  all  the  plots  October  is  a  month 
in  which  the  average  loss  per  inch  of  rain  is  small,  or  in  which  rain  above  the  average  is 
230sitively  beneficial.  This  is  the  reverse  of  the  condition  found  by  Hooker,  who  finds 
the  greatest  negative  correlation  with  rain  early  in  October,  (iv)  In  all  plots  save  two, 
the  unmanured  plot  (3  and  4)  and  plot  5  which  receives  mineral  (non-nitrogenous) 
manures  only,  the  autumn  period  of  benefit,  or  but  little  loss,  from  rain,  is  followed 
by  a  period  centred  in  January  in  which  dry  conditions  appear  to  be  particularly  desirable. 
At  this  time  of  the  year  each  additional  inch  of  rain  costs  from  one  to  two  bushels  in 
the  crop.  That  this  effect  is  scarcely  visible  on  the  unmanured  plot,  and  still  less  so 
on  plot  5,  speaks  strongly  in  favom-  of  Sir  John  Lawes's  view  that  the  damage  done 
by  winter  rain  was  principally  occasioned  by  the  washing  out  of  nitrates  from  the  soil. 
Plot  (3  and  4)  and  especially  plot  5  can  have  little  to  lose  in  this  way,  and  possibly  rely 
for  the  greater  part  of  their  growth  upon  nitrates  produced  by  bacterial  activity  as  the 
soil  grows  warmer  in  March,  April  and  May.  The  same  view  is  confirmed  by  the  fact 
that  among  the  remaining  plots  the  loss  due  to  winter  rain  is  least  in  plots  2b,  10  and  11, 
all  of  which  are  characterised  by  the  low  proportion  utilised  of  the  nitrogen  supplied. 
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If  these  plots  seldom  suffer  from  lack  of  nitrogen,  its  loss  during  the  winter  must  be 
exjaected  to  affect  the  crop  relatively  little  ;  nevertheless  the  actual  average  loss  of 
about  one  bushel  to  the  inch  of  rain  shows  that  at  this  time  {i.e.,  before  the  spring 
dressings  are  applied  to  plots  10  and  U,  and  perhaps  before  bacterial  activity  can  fidly 
tap  the  resources  of  the  farmyard  maniire),  a  temporary  shortage  sometimes  injures  the 
crop.  With  these  preliminary  observations  we  may  pass  to  the  consideration  of  the  types 
of  rainfall  response  curve  actually  found. 

Type  I. — Plot  5,  mineral  manures  only,  and  plot  (3  and  4),  umiianured.     Fig.  4  shows 
plot  5. 

+  -4 


+  -2 


C8 


P^ 


to 


BO 


-    2 


-  -4 


-  -e 


-  -8 


-1-0 


-1-2 


-1-4 


-1-6 


1 

/ 

/ 

^^ 

■v^ 

/ 

/ 

*s 

^^ 

-y 

/ 

Plot  5 

SEP.      OCT.      NOV.      DEC.      JAN.      FEB. 

Fig.  4. 


MAR.      APR.      MAY     JUNE    JULY        AU6. 


In  these  two  plots  the  crop  is  severely  limited  by  the  lack  of  available  nitrogen.  Plot  5 
has  a  mean  yield  of  14- 18  bushels  per  acre,  while  plot  (3  and  4)  yields  12-27.  These  are 
averages  (1852-1918)  covering  the  whole  period  over  which  the  rainfall  regressions 
have  been  calculated.  Both  plots  show  heavy  deterioration  over  this  period. 
The  small  difference  in  the  mean  yields,  which  is  nearly  constant  over  the  whole  period, 
shows  that  lack  of  nitrogen  is  the  dominant  limitation  in  plot  (3  and  4)  as  in  plot  5  where 
the  limitation  due  to  lack  of  nitrogen  must  be  most  stringent. 

In  plot  5  the  rain  experienced  is  in  excess  of  the  optimum  probably  at  all  times  of 
the  year.  The  average  loss  for  additional  rain  falls  from  about  0-4,  in  the  September 
before  sowing,  to  a  minimum  of  0-22  (1-5  per  cent.)  towards  the  end  of  October,  and 
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remains  steadily  near  to  0-30  till  the  end  of  May.  It  falls  to  a  maximum  loss  of  0-55 
(3-9  per  cent.)  early  in  July  and  rapidly  diminishes  through  August.  Since  the  pro- 
portion of  drainage  through  the  twenty-inch  gauge  decreases  from  nearly  80  per  cent, 
in  the  winter  to  about  25  per  cent,  from  April  onwards,  it  follows  that  if  the  main  loss 
on  this  plot  due  to  rainfall  is  ascribed  to  the  wa.shing  out  of  nitrates,  the  loss  due  to  an 
equal  amount  of  drainage  is  increasingly  serious  as  the  year  advances.  This  accords 
with  the  belief  that  nitrification  becomes  more  active  in  the  Spring,  and  that  this  plot 
relies  largely  for  its  nitrogen  supply  upon  bacterial  activity  at  this  season.  The  curve 
even  suggests  the  possibility  that  under  the  intense  nitrogen  starvation  of  this  plot 
the  sujjply  of  nitrates  is  of  importance  as  late  as  the  beginning  of  July,  though  it 
must  be  borne  in  mind  that  other  effects  of  rain  may  become  important  during  the 
summer. 

The  curve  for  plot  (3  and  4)  is  closely  similar.  September  damage  of  about  0-4  is 
followed  by  a  minimum  of  0-23  (1-9  per  cent.)  in  October.  There  is  slight  evidence 
of  a  maximum  of  damage  in  the  winter  of  0-37  (3-0  per  cent.)  followed  by  a  minimum 
damage  of  0-24  (2-0  per  cent.)  at  the  end  of  March  ;  this  feature  becomes  marked  in 
plots  in  which  the  nitrogen  limitation  is  less  acute,  and  is  perhaps  an  indication  that  this 
condition  is  less  extreme  in  the  unmanuxed  plot  than  in  plot  5.  The  maximum  damage 
in  early  summer  is  0  •  58  (4  •  7  per  cent.)  ;  little  importance  can  be  attached  to  the  positive 
values  of  the  last  12  days  of  August,  save  as  in  indication  that  August  rain  is  inoperative 
after  the  first  week  of  the  month. 

Type  II. — Plot  6,  complete  minerals  with  single  dressing  of  ammonium  salts  applied 
half  in  Autumn  and  half  in  Spring  ;  plot  17  and  18  mineral  series,  receiving  mineral 
manures  only,  but  alternating  with  the  ammonium  series  which  receives  a  double  dress- 
ing of  ammonium  salts.     Fig.  5  shows  plot  6. 

The  similarity  of  the  curves  for  these  two  plots  is  striking,  and  requires  that  the 
interpretation  put  on  the  yields  of  the  alternating  plot  should  be  reconsidered.  Plot  G 
has  a  mean  yield  of  22  •  58  bushels  to  the  acre,  and  shows  a  relatively  rapid  deterioration  ; 
17  and  1 8  minerals  has  a  mean  yield  14  •  51 ,  scarcely  more  than  that  of  plot  5  which  receives 
the  same  di"essing  ;  its  average  deterioration  also  resembles  plot  5.  Hence  it  has  been 
thought  that  no  appreciable  benefit  accrued  to  the  alternating  series  from  the  previous 
year's  dressing  of  ammonium  salts.  It  has  however  been  observed  (5,  1921)  that  the 
alternating  series  is  much  more  variable  in  yield  than  is  plot  5,  and  this  suggests  that  the 
additional  variation  is  due  to  variable  residue  of  nitrogenous  material.  This  suggestion 
is  strongly  confirmed  by  the  resemblance  to  plot  6,  especially  in  the  effects  of  winter 
rainfall. 

Plot  6  has  a  strongly  marked  minimum  of  damage  in  early  October,  when  the  loss  is 
only  0-10  bushel  per  inch  of  rain,  or  0-4  per  cent.  ;  this  is  followed  by  a  winter  period 
in  which  rain  is  on  the  average  particularly  harmful  to  the  extent  of  1-14  (5-0  jjer 
cent.);  from  April  to  July  the  average  damage  is  nearly  constant  at  about  0-64 
(2-8  per  cent.). 
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Fig.  5. 


The  effects  on  17  and  18  minerals  are,  on  the  whole,  smaller,  being  approximately  pro- 
portional to  the  lower  yield.  A  minimum  of  damage  occurs  in  October  0  •  10  (0  •  7  per  cent.), 
followed  by  a  winter  maximum  0-64  (4-4  per  cent.),  and  by  a  nearly  stationary  period 
from  April  to  July  at  0-42  (2-9  per  cent.).  The  winter  effect  is  smaller  both  absolutely 
and  relatively  than  that  of  plot  6,  but  if  it  be  admitted  that  the  importance  of  winter 
rain  lies  largely  in  the  washing  out  of  nitrates,  the  evidence  for  considerable,  though  no 
doubt  variable,  residues  in  the  alternating  series  is  unmistakable.  The  fact  that  the 
alternating  series  has  not  given  an  appreciable  higher  yield  than  that  of  plot  5,  must  be 
ascribed  to  soil  heterogeneity,  which  has  approximately  balanced  the  advantage  of  the 
residual  nitrogen.  That  such  soil  heterogeneity  exists  in  this  field  may  be  shown  by 
a  comparison  of  2a  with  the  adjacent  plot  26.  These  have  received  the  same  manure 
since  1885,  but  during  the  present  century  2a  has  ceased  to  gain  in  yield  upon  26,  but  yields 
very  regularly  on  the  average  2  bushels  less  than  the  adjacent  plot.  In  view  of  this 
fact  we  cannot  deny  the  possibility  that  with  equal  manuring  the  land  of  plots  17  and  18 
would  yield  2  or  more  bushels  less  than  that  of  plot  5,  and  that  this  circumstance  has 
served  to  mask  the  advantage  accruing  to  the  mineral  series  owing  to  residual  nitrogen. 
It  may  be  mentioned  in  addition  that  the  mean  yield  of  17  and  18  ammonium  series  is 
2  •  36  less  than  that  of  plot  7,  a  difference  which  may  be  ascribed  to  the  combined  effect 
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of  soil  inferiority  anil  luclc  of  residual  nitrogen  ;  it  should  be  borne  in  niind  tliat  a  differ- 
ence of  this  amount  in  a  yield  of  30  bushels  probably  would  require  the  addition  of  con- 
siderably more  nitrogen  than  an  equal  increase  in  a  yield  of  14  bushels. 

Type  III. — Plot  7,  complete  minerals,  with  double  dressing  of  ammonium  salts,  of 
which  one  quarter  is  applied  in  autunm. 

Plot  13,  as  plot  7  without  sulphates  of  sodium  and  magnesium. 

Plot  17  and  18  ammonium  series. 
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These  three  plots  receive  similar  manurial  treatment;  the  mean  yields  are  31-37, 
30-21  and  29-01  bushels  per  acre,  with  mean  annual  decrements  0-144,  0-123  and  0-114. 
It  might  have  been  anticipated  that  the  decrement  should  be  least  on  the  alternating  plot  ; 
for  the  rest  it  is  not  obvious  to  what  circumstances  the  small  difference  between  the  plots 
are  to  be  ascribed. 

The  three  rainfall  curves  are  generally  similar,  all  indicate  a  slight  benefit  for  something 
over  a  month  in  autumn,  the  maxima  being  -f  0-14  (0-4  per  cent.),  +0-43  (1-4  per 
cent.),  and  +0-41  (1-4  per  cent.)  ;  the  winter  damage  is  strongly  marked  with  values 
1-47  (4-7  per  cent.),  1  -82  (6-0  per  cent.)  and  1  -85  (6-4  per  cent.).  There  is  in  all  cases 
a  clear  period  of  minimum  damage  in  May,  the  -values  falling  to  0-34  (1-1  per  cent.), 
0-34  (1-1  per  cent.)  and  0-20  (0-7  per  cent.),  followed  by  a  second  period  of  maximum 
damage  in  July  which  is  however  less  severe  than  the  winter  maximum,  the  values  are 

VOL.  CCXIII. — B.  T 


130 


MR.  R.  A.   FISHER   ON   THE   INFLUENCE   OF   RAINFALL 


0-89  (2-8  per  cent.),  0-79  (2-6  per  cent.)  and  0-94  (3-2  per  cent.).  On  the  average  plot 
13  suifers  somewhat  more  heavily  from  rain  than  do  plots  7  and  the  ammonium  series,  but 
in  consideriiio-  the  effects  at  different  times  of  the  year  it  is  seen  to  agree  wth  the  latter 
closely  during  the  first  half  of  the  year,  and  more  nearly  \vith  plot  7  during  the  second  half. 

Type  IV. — Plot  8  as  plot  7  with  additional  ammonium  salts  appHed  in  the  spring. 

Plot  12  as  plot  13  with  substitution  of  sodium  for  potassium  sulphate. 

Plot  14  as  plot  13  with  substitution  of  magnesium  for  potassium  sidphate. 

Fig.  7  shows  plot  8. 
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Fig.  7. 

In  most  respects  plot  8  contrasts  strongly  with  plots  12  and  14,  which  however  it 
resembles  in  its  response  to  rainfall.  The  mean  yield  is  35-09  against  28-32  and  27-77. 
while  it  shows  but  slight  deterioration,  0-U92  annually,  as  against  the  heavy  deterioration. 
0-181  of  plot  12,  and  0-231  of  plot  14.  The  resemblance  consists  iri  the  greater  relative 
abundance  of  nitrogen  supplied  to  these  plots  than  to  those  in  Type  III,  and  the  corre- 
spondingly less  relative  abundance  of  nitrogen  than  those  in  Type  V.  In  tlie  case  of 
plot  8,  although  the  yield  is  greater  than  that  of  plot  7,  yet  since  it  receives  a  triple  in  place 
of  a  double  dressing  of  ammonium  saltj^,  it  is  clear  that  this  plot  must  less  fi-equently 
suffer  from  lack  of  nitrates.  Plots  12  and  14  on  the  other  hand  receive  the  same  nitro- 
genous dressing  as  plot  7,  but  owing  to  lack  of  potash  yield  some  3  bushels  an  acre  less  ; 
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consequently  with  them  also  crop  limitation  through  lack  of  nitrates  nuist  be  less  frequent. 
The  same  argmnent  applies  still  more  strongly  to  plots  10  and  U  in  the  next  type. 

The  benefit  of  October  rain  is  relatively  large  in  plots  8  and  14,  and  plot  12  also  reaches 
2)ositive  values  at  this  period,  the  three  values  are  +0-29  (0-8  per  cent.),  +0-10  (0-3 
per  cent.)  and  +0-36  (1-3  per  cent.)  ;  the  winter  damage  is  strongly  marked,  though 
on  the  whole  less  so  than  in  the  preceding  group,  1  •  53  (4 •  3  per  cent.),  1  •  65  (5 •  8  per  cent.), 
1-52  (5-5  per  cent.).  The  chief  contrast  hes  in  the  deepening  of  the  second  period  of 
maxinmm  damage  ;  in  consequence  of  this  the  time  of  minimum  summer  damage  is 
shifted  back  to  the  begiiming  of  May  with  values  0  •  30  (0  •  8  per  cent.),  0  •  49  (1  •  7  per  cent.) 
and  0-23  (0-8  per  cent.)  ;  while  the  second  maxinmm  reaches  in  late  July  the  values 
1  -27  (3-5  per  cent.),  1-48  (5-2  per  cent.),  1  -23  (4-4  per  cent.),  showing  that  dry  weather 
at  this  period  is  to  these  plots  nearly  as  beneficial  as  in  January. 
Type  V. — Plot  10  double  dressing  ammonium  salts  only. 

Plot   11  double  dressing  ammonium  salts  with  superphosphate.      Fig.  8 
shows  plots  10  and  2b. 
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In  these  two  plots  the  yield  is  very  seriously  lowered  by  lack  of  potash,  and  in  the 
case  of  plot  10  by  lack  of  phosphate  also.  The  mean  yields  are  19-50  and  22-05,  with. 
heavy  annual  deterioration  0-157  and  0-219.  Plot  10  with  the  lower  yield  is,  as  might 
be  expected,  the  more  extreme  example  of  the  type. 

Both  ciu-ves  are  positive  in  October  with  values  +0-35  (1-8  per  cent.)  and  +0-16 
(0  -  7  per  cent.)  ;  the  winter  damage  is  less  marked  in  absolute  value  than  in  the  types 
III  and  IV.,  being  only  0-92  (4-7  per  cent.)  and  0-85  (3-8  per  cent.)  ;  in  April  positive 
values  are  reached  in  both  cases,  those  of  plot  10  being  even  higher  than  in  October, 
+  0-37  (1-9  per  cent.)  and  +  0-03  (0-1  per  cent.).  The  July  damage  is  more  ext.reme 
than  in  preceding  types  being  considerably  greater  than  that  in  January,  namely, 
1-67  (8-5  per  cent.)  and  1-43  (6-5  jjer  cent.).  In  both  the  April  and  the  July  values 
plot  10  contrasts  more  strongly  than  plot  1 1  with  the  plots  of  type  IV. 

Type  VI.^The  dunged  plot  2b  (fig.  8). 

This  plot  differs  £rom  all  the  others  in  the  textm-e  of  its  soil,  and  to  this  perhaps  may 
be  ascribed  the  absence  of  strong  contrasts  in  the  effects  of  rain  at  different  seasons  ; 
the  average  advantage  of  dry  Weather  is  comparable  with  the  other  plots,  but  the  only 
marked  feature  of  the  curve  is  the  severe  damage  caused  by  rain  immediately  preceding 
harvest.  This  feature  may  perhaps  be  analogous  to  the  heavy  damage  of  July  rain 
in  plots  10  and  11,  which  it  should  be  noted  resemble  the  dunged  plot  in  their  relative 
abundance  of  nitrogen.  Plot  2b  differs  strikingly  from  all  the  other  plots,  and  especially 
from  plots  10  and  11  in  the  relative  constancy  of  its  yield. 

The  mean  yield  is  34-55  bushels  per  acre.  The  autunm  minimum  of  rain  drainage  is 
0-43  (1-3  per  cent.),  the  winter  maximum  1  -00  (2-9  per  cent.),  the  summer  minimum  is 
well  marked  and  prolonged  at  0-24  (0-7  per  cent.),  after  which  the  curve  falls  to  a  final 
value  of  1-39  (4-0  per  cent.)  ;  as  we  have  seen,  the  mean  damage  for  harvest  rain  which 
catches  the  crop  is  no  doubt  greater. 


10.  The  Value  of  Rainfall  Regressions  as  Prediction  Formula. 

The  extent  to  which  the  variation  in  crop  yield  is  predicable  from  a  rainfall  record  may 
be  calculated  from  the  coefficients  of  multiiile  regression  of  the  last  section.  As  we  have 
seen  in  section  2  the  coefficient  of  nudtiple  correlation  gives  a  much  exaggerated  notion 
of  the  prediction  value  of  a  regression  formula,  if  calculated  fiom  a  small  sample.  In 
the  present  instance,  although  our  series  of  values  is  a  long  one,  the  number  of  degrees 
of  freedom  has  been  whittled  down  to  54  by  the  rejection  of  misuitable  years,  and  the 
elimination  of  slow  changes  ;  six  meteorological  variates  have  been  used,  which  represents 
the  utmost  economy  in  view  of  the  complexity  of  the  meteorological  sequences.  The 
distribution  of  the  variance,  between  the  6  degrees  of  freedom  of  the  regression  formula, 
and  the  48  degrees  of  freedom  in  which  the  variates  may  differ  from  the  regression  fornmla 
is  shown  in  Table  XIV,  in  which  are  also  shown  the  multiple  correlation  R,  and  the 
percentage  of  variance  (A,  of  section  2)  ascribable  to  the  average  effect  of  rain. 
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Plot. 

Sum  of  squares. 

Total 
54  degrees. 

R. 

per  ceut. 

Regression 

formula 

6  degrees. 

Deviations 
48  degrees. 

2b 

3  &4 

5 

6 

7 

8 
10 
11 
12 
13 
14 

17  &  18  M. 
17  &  18  A. 

884 
149 
163 
563 
790 
959 
796 
668 
1,395 
985 
938 
199 
965 

996 

336 

401 

1,062 

2,091 

2,083 

1,827 

2,501 

2,069 

1,938 

2,069 

686 

1,189 

1,880 

485 

564 

1,625 

2,881 

3,042 

2,623 

3,169 

3,464 

2,923 

3,007 

885 

2,154 

0-6859 
0-5548 
0-5370 
0-.5885 
0-5236 
0-5614 
0-.5509 
0-4592 
0-6.345 
0-5803 
0-5584 
0-4736 
0-6693 

40-42 
22-13 
19-93 
26-46 
18-34 
22-96 
21-63 
11-23 
32-79 
25-39 
22-58 
12-73 
37-90 

It  mil  be  seen  how  very  inadequate  is  the  vahie  of  R  to  indicate  the  vahie  of  prediction 
formula  ;  the  extreme  vahies  of  R  in  the  above  totals  are  0-459  and  0-686,  but  these 
values  indicate  in  this  case  that  in  one  plot  11  per  cent.,  and  in  the  other  40  per  cent,  of 
the  variance,  is  expressible  in  terms  of  the  sequence  of  rain  records. 

It  is  remarkable  that  so  much  of  the  variance  as  40  per  cent,  should  be  expressible  in 
terms  of  a  single  meteorological  element  such  as  rainfall,  especially  when  it  is  remembered 
that  all  causes  of  variation  without  exception,  including  casual  errors,  and  the  quadratic 
terms  of  the  rainfall  effect,  are  included  in  the  remaining  60  per  cent.  This  leads  us  to 
think  that  a  record  of  rainfall,  in  spite  of  the  many  disabilities  which  have  been  urged 
against  it,  is  of  more  value  than  the  record  of  any  other  single  element,  in  characterising 
the  season. 

The  effects  ascribed  to  rain  are  in  most  plots  clearly  significant ;    the  values  of  P, 

calculated  from  the  formula 

(1  -  Wf^  (1  +  24R=  +  300R^) 

range  from  U- 00001 86  to  0-0659. 

The  excessive  variation  of  plot  11  thus  masks  the  rainfall  effect  in  the  same  maimer 
that  it  masks  the  slow  changes  in  this  plot ;  the  rainfall  effect  is  somewhat  the  more 
important  and  shows  up  more  clearly.  In  both  cases  the  similarity  of  the  curves  of  plot  1 1 
to  those  of  other  plots,  especially  those  which  it  would  be  expected  most  nearly  to  resemble, 
shows  that  no  serious  deviations  have  been  introduced  into  these  by  random  fluctuations. 

The  probable  errors  of  random  samphng  of  the  regressions  of  crop  on  rainfall  may  be 
calculated  as  demonstrated  by  the  author  in  1922  (9).  The  number  of  years  is  sufficient 
to  ensure  the  effective  normality  of  the  distributions.  In  the  comparison  of  the  regi'ession, 
of  any  two  plots,  we  may  anticipate  that  the  random  errors  are  on  a  substantially  smaller 
scale,  since  the  experience  of  all  the  plots  is  drawn  from  an  identical  series  of  seasons. 
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No  statistical  estimate  of  the  accuracy  of  curves  for  purposes  of  comparison  can  be  made 
in  the  absence  of  strictly  parallel  plots.  It  may,  however,  be  confidently  anticipated 
that  random  errors  of  this  kind  are,  if  anything,  of  a  smaller  order  than  the  differences 
between  the  curves  obtained  for  plots  7,  13  and  (17  and  18)  ammonium  series. 

Reality  of  Sloiv  Changes. 

The  prediction  formidse  which  we  have  obtained  are  entirely  independent  of  the  slow 
changes  which  appear  to  have  taken  place  in  yield  and  in  weather.  In  presenting  the 
evidence  for  slow  changes  in  yield  (5)  it  was  presumed,  in  the  absence  of  a  full  investigation 
of  the  sequence  of  rainfall  records,  that  favourable  and  unfavourable  weather  conditions 
fluctuated  independently  from  year  to  year  ;  in  fact,  that  slow  changes  were  absent  from 
the  meteorological  series.  We  now  know  that  a  small  percentage  of  the  variance  of 
annual  rainfall  should  probably  be  ascribed  to  slow  changes,  and  that  sequences  of  wet 
years  did  in  fact  occur  about  1879  and  1914.  Since  these  two  periods  agree  with  the 
two  main  depressions  in  yield,  it  becomes  at  first  sight  questionable  whether  the  slow 
changes  in  yield  may  not  after  all  be  ascribable  to  meteorological  effects. 

This  question  is  most  readily  answered  by  calculating  the  actual  depression  in  yield 
ascribable  to  the  additional  rainfall  in  the  two  rainy  periods.'  Plot  2b  is  most  suitable 
for  a  comparison  for  the  residual  variance  is,  in  comparison  to  the  yield,  very  small 
in  this  plot,  and  the  prediction  formula  correspondingly  accurate.  The  yield  in  any 
year  is  regarded  as  made  of  two  parts,  part  (positive  or  negative)  is  due  to  the  rainfall 
sequence  under  which  the  crop  was  grown,  the  remainder  is  the  yield  handicapped  for 
the  advantages  or  disadvantages  ascribable  to  rainfall.  The  sequence  of  10-year  means 
of  these  two  quantities  is  shown  in  fig.  9.  It  will  be  seen  that  the  yield,  after  maldng 
allowance  for  rainfall,  still  shows  strongly  the  slow  changes  which  originally  attracted 
attention.  The  changes  in  weather  only  account  directly  for  a  portion  of  the  slow 
changes  observed  ;  in  the  first  half  of  the  sequence  this  portion  is  roughly  one-half, 
but  in  the  latter  half  it  would  appear  not  to  exceed  one-quarter  of  the  total  effect. 

The  circmustancc  that  in  two  cases  the  depression  of  yield,  after  allowing  for  rainfall, 
coincides  with  a  series  of  years  in  which  the  rainfall  was  on  the  average  unfavoimible, 
is  in  accordance  with  the  suggestion  previously  i>ut  forward,  that  in  addition  to  the 
immediate  effect  of  rainy  weather  on  the  growing  crop,  a  sequence  of  wet  years  produced 
an  a<l(liti()nal  and  prolonged  unfavourable  influence  by  fostering  weeds.  Two  cases  of 
agreement  are,  liowever,  quite  insufficient  to  prove  that  the  additional  depression  of 
yields  is  causally  connected  with  the  rainy  periods.  The  fact  of  the  coincidence,  whether 
causal  or  fortuitous,  does,  however,  emphasise  the  importance  of  eliminating  slow  changes 
in  the  study  of  annual  figures  ;  if  slow  changes  had  been  ignored  in  carrying  through 
the  correlational  work,  we  should  no  doubt  have  arrived  at  greatly  exaggerated  estimates 
of  the  liarniful  effect  of  rain.  Higher  correlations  would  have  been  obtained  and  the 
results  would  have  exhibited  one  more  case  of  high  but  essentially  unreliable  correlations 
in  meteorological  agricidture.  Against  the  view  that  the  depression  in  yields  is  an 
indirect  effect  of  a  rainy  period  nmst  be  set  the  fact  that  in  our  diagrams  the  yield  appears 
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to  fall  off  somewhat  before  the  wet  sequence  has  set  in  ;  both  series  being  10-year  means  the 
slight  falsification  of  the  sequence  incurred  by  using  this  form  of  representation  should 
have  a  similar  effect  in  both  series. 

Effect  of  Fallotv. 

Certain  years  were  rejected  o^ving  to  the  crops  grown  having  followed  a  fallow  ;  it  was 
suspected  that  the  yields  obtained  in  these  years  were  undvdy  high,  and  would  vitiate 
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Fig.  9.— Graph  A  shows  the  variation  in  the  expectation  of  wheat  yield  as  judged  from  the  quantity  and 
date  of  the  rain  during  the  year.  Graph  B  shows  the  additional  variation  of  the  actual  yields 
observed  after  allowance  for  the  expected  rainfall  effect.  Both  graphs  show  10-year  means,  in 
bushels  per  acre. 

the  weather  correlations  if  they  were  retained.  A  knowledge  of  the  expected  effect 
of  weather,  together  with  that  of  the  general  fertility  of  the  plot  in  neighbouring  years, 
makes  it  possible  to  estimate  the  amount  by  which  the  fallow  benefited  the  crop.  On 
plot  2b  the  excess  of  yield  over  expectation  in  1890,  18.91,  was  4  •  48,  6  •  77  ;  in  1905,  190G, 
the  excess  values  were  6-98,  5-95,  while  in  1915  the  excess  was  nearly  15-50.  In  191G 
the  two  halves  of  the  field  were  harvested  separately,  and  the  half  that  followed  fallow 
showed  an  excess  of  14-30  bushels  over  the  other.  This  suggests  that  the  value  for  J915 
is  not  so  inaccurate  as  might  be  thought. 

In  all  these  6  cases  the  effect  of  fallow  is  strongly  marked  ;  the  average  gain  is  the 
very  large  amount  of  9  bushels  per  acre.  It  shoidd  be  remembered  that  this  plot  receives 
a  heavy  dressing  of  farmyard  manure,  and  it  is  doubtful  if  any  part  of  the  gain  can  be 
ascribed  to  the  accumulation  of  plant  nutrients.  The  immediate  object  of  fallowing  was 
in  all  cases  the  eradication  of  weeds,  aiid  it  may  be  that  the  increased  yield  was  mainly 
due  to  a  greater  freedom  of  the  land  from  weed  infestation.     If  that  is  so  the  effect  must 
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have  been  remarkably  transient,  for  the  vahie  for  1916  depends  on  a  comparison  of  the 
crop  immediately  following  fallow  with  one  which  had  been  fallowed  only  two  years 
before.  The  supposition  that  the  benefit  was  largely  a  matter  of  freedom  from  weeds 
accords  with  the  higher  values  of  1915,  1916,  when  the  field,  to  judge  by  the  average 
yields  on  all  plots,  was  much  more  severely  infested  than  in  1890  or  1905. 

The  magnitude  of  the  eiiect  of  fallow  illustrates  the  complications  which  arise  in  the 
interpretation  and  statistical  reduction  of  even  the  best  agricultural  data.  We  have 
attempted  to  ehminate  the  large  changes  which  have  occurred  in  the  yields  of  these  plots 
by  means  of  continuous  curves,  but  there  is  reason  to  fear  that  even  if  the  fertility  of 
the  soil  normally  suffers  continuous  changes  from  year  to  year,  yet  discontinuities  were 
introduced  at  the  periods  when  fallowing  was  resorted  to.  By  rejecting  the  crops 
immediately  concerned  the  greater  part  of  the  errors  have  no  doubt  been  eliminated, 
yet  the  theoretical  efficacy  of  the  continuous  curve  is  im]iaired,  however  nearly  it  appears 
in  practice  to  represent  the  actual  com'se  of  events. 

11.  Comparison  with  Previous  Results. 

In  1880  La  WES  and  Gilbert  published  (12)  a  long  and  careful  account  of  the  meteoro- 
logical characteristics  of  seasons  favourable  and  unfavourable  to  wheat.  The  object 
was  to  effect  a  qualitative  and  preliminary  enquiry  (p.  174)  :  "  As  yet,  however,  the 
connection  between  meteorological  phenomena  and  the  progress  of  vegetation  is  not  so 
clearly  comprehended  as  to  enable  us  to  estimate  Avith  any  accuracy  the  yield  of  a  crop 

by  studying  the  statistics  of  the  weather  during  the  period  of  its  growth 

But  it  is  only  by  a  careful  comparison  of  the  characters  of  the  seasons  on  the  one  hand, 
and  of  the  quantity  and  quality  of  the  produce  on  the  other,  for  many  years,  that  we 
can  hope  to  acquire  sufficient  knowledge  to  enable  us  to  assign  to  the  various  agencies, 
the  sum  of  which  constitutes  the  climate  of  the  year,  their  respective  values  in  the  jiro- 
duction  of  the  crop." 

In  respect  of  rainfall,  Lawes  and  G-ilbert,  using  principally  Eothamsted  data,  but 
also  some  instances  of  years  of  exceptional  harvests  prior  to  their  experiments,  concluded 
that  comparative  dryness  was  desirable  from  Seed-time  (November)  to  harvest,  especially 
in  Winter  and  early  Spring.  Their  table  of  averages  shows  no  exceptional  dryness  of 
October  for  the  favourable  years,  or  exceptional  wetness  in  the  unfavourable  years. 
The  effect  of  winter  and  spring  rain  is  ascribed  partly  to  drainage  causing  loss  of  nitrates, 
and  partly  to  hindering  root  development. 

It  appears  that  in  the  points  upon  which  they  laid  most  stress  the  conclusions  arrived 
at  by  the  more  exact  statistical  methods  now  available  and  with  the  aid  of  39  more 
seasons'  experience  at  their  station,  would  have  caused  Lawes  and  Gilbert  only  to 
reaffirm  their  conclusions  more  strongly  and  with  gi'eater  precision.  The  comparison 
of  different  jjlots  has  emphasised  the  importance  of  the  drainage  of  nitrates,  and  points 
to  further  influence  of  manurial  conditions  on  the  response  of  the  crop  to  July  rain.  The 
cause  of  this  is  at  present  obscure,  but  the  effect  seems  well  marked.     In  addition  it  has 
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been  possible  to  make  some  attempt  to  develop  fonnuhe  winch  shall  predict  the  viekl 
from  the  weather  statistics. 

Attention  Avas  called  to  the  possibility  that  autumn  rainfall  was  an  important  factor 
in  determining  the  wheat  crop  by  Sir  Napier  Shaw  in  1905  (14).  Shaw  did  not  use  the 
method  of  correlation,  but  pointed  out  that  in  the  twenty  years  1885-1904.  with  two 
exceptions,  when  the  yield  for  Eastern  England  was  above  the  average  the  previous 
autmnn  rainfall  was  below  the  average  and  vice  versa.  In  this  set  of  20  jjairs  of  values 
the  correlation  is  actually  —  0  •  629,  a  value  which  would  only  be  exceeded  bv  uncorre- 
lated  variates  in  3  samples  of  20  out  of  a  thousand.  The  particular  meteorological  variate. 
autimm  rainfall,  was  picked  out  of  a  table  giving  at  least  3G  meteorological  quantities, 
and  the  chance  that  all  of  these,  if  independent  and  in  reality  uncorrelated  with  the  crop, 
shoidd  give  correlations  between  ±0-629,  was  therefore  about  0-89.  Such  a  system 
woidd  therefore  be  exjjected  to  yield  so  high  a  correlation  only  once  in  9  trials,  and  the 
fact  that  such  a  correlation  occurred  supplied  some  presumption  that  autumn  rainfall 
had  in  reality  a  perceptible  influence  on  the  crop. 

In  discussing  the  significance  of  this  residt  Shaw  (14.  pp.  318-319)  made  the  inter- 
esting suggestion  that  the  association  observed  might  not  be  due  wholly  to  the  effects 
of  autumn  weather,  but  possibly  to  dry  autunms  being  frequently  followed  by  a  favourable 
succession  of  weather  in  Spring  and  Summer. 

'■  He  woidd  like  to  say  a  word  with  resjiect  of  Ma".  Thomas's  suggestion  that  the 
autmnn  rainfall  was  not  the  dominant  factor  in  determining  the  subsequent  yield 
of  wheat.     What  surprised  him  were  not  the  exceptions,   but  the  agreements. 
Remembering  that  nine  months  had  to  run  between  the  end  of  autunui  and  tlie 
beginning  of  harvest,  and  considering  the  influence  of  the  intervening  rainfall,  sun- 
shine and  other  accidents  that  might  happen  to  the  crop  before  it  was  gatherefl, 
it  was  surprising  that  the  connection  should  be  so  close  as  to  be  expressed  possibly 
numerically.     It  might  be  true  that  two  other  columns  of  figures  might  be  tabidated 
wliich  would  show  a  closer  agreement  than  the  two  columns  put  down  in  the  table, 
but  they  certainly  would  not  be  colunms  of  figures  for  individual  elements.     To 
take  two  elements  out  of  the  whole  table,  put  them  side  by  side,  and  find  them  to 
agree  as  they  did  in  this  case,  was  astonishing.     Of  course  other  influences  affected 
wheat,  and  it  might  be  that  a  sequence  of  influences  was  reqiured  to  follow  the  autumn 
rainfall  in  order  to  bring  out  the  corresponding  result.     It  might  be  that  the  relation 
was  a  meteorological  one,  and  that  a  dry  au.tumn  itself  implied  a  dry  spring  or  a  dr>' 
summer,  or  whatever  combination  of  circvunstances  was  required  for  a  good  yield. 
He  could  give  a  certain  amount  of  evidence  in  favour  of  the  contention  that  a  meteoro- 
logical relation  existed,  and  that  it  was  not  what  took  place  in  the  autumn  alone  which 
might  accoimt  for  the  relation,  but  succeeding  events  as  well,  which  were  associated 
with  a  dry  autunm.     That  contingency  made  the  subject  one  of  considerable  interest, 
and  one  which  must  be  pursued  ratlier  more  fully  than  was  possible  on  the  present 
occasion." 
VOL.  ccxiii. — B.  r 


138 


MK.  K.  A.  FISHER  ON   THE  INFLUENCE  OF   KAINFALL 


A  great  advance  in  method  is  shown  by  the  magnificent  paper  by  Hooker  (10)  in  1907. 
Hooker  systematically  correlated  rain  and  accumulated  temperature  for  8-week  periods 
throughout  the  year  with  the  yields  of  a  number  of  farm  crops  in  Eastern  England,  using 
a  slightly  different  area  from  that  used  by  Shaw.  Hooker  clearly  recognised  the  two 
limitations  from  which  the  method  of  correlation  suffers,  namely  that  it  takes  account  only 
of  the  linear  relation  of  the  variates,  and  secondly  that  the  correlations  obtained  will 
be  much  affected  by  any  such  meteorological  associations  between  the  weather  at  different 
times  of  the  year  as  were  suggested  by  Shaw.  It  was  to  remove  these  two  limitations 
from  the  treatment  of  the  Rothamsted  data  that  the  present  method  of  computing 
the  partial  regressions  of  the  yield  on  the  weather  at  each  period  of  the  year  was  devised, 
a  method  which  as  we  have  seen  can  be  extended  to  the  discussion  of  the  quadratic  terms 
of  the  regression  function. 

Hooker  found  high  negative  correlations  between  the  wheat  yield  and  the  rainfall 
for  periods  centred  in  October  and  January,  while  between  them  small  positive  correla- 
tions occurred.  In  May  again  positive  correlations,  this  time  somewhat  larger,  made  their 
appearance.  It  is  concluded  that  a  dry  September-October  ranks  first  among  the  wheat's 
requirements,  while  the  effect  of  winter  rain  is  ascribed  partially  to  the  high  correlation 
(+0-6)  found  between  the  rainfall  of  weeks  1-8  and  that  of  the  preceding  weeks  37-44. 

In  1922  Hooker  (11)  gave  recalculated  figures  for  the  same  variates  ;  the  results  being 
now  based  on  3.5  years,  possess  considerable  significance.     Diagxam  10  sh<\vstlie  correla- 
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Fig.    10.— Rainfall  corrdatioii.s  obtained  by  Hooker  from  uttieial  rt'tiinis  of  wheat  yields  for  Eastern 
England.  u-iin<^  rainfall  for  8-\veek  periods  centred  at  the  dates  .shown. 
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tions  obtained  for  successive  (overlapping)  8-week  periods,  in  respect  of  which  Hooker 
draws  the  following  conclusions  (p.  120)  : — 

"  Looking  first  at  the  ciu-ve  showing  the  connection  between  rain  and  wheat, 
the  most  striking  features  are  the  dips  about  October  and  in  winter,  both  very  nearly 
identical.  As  a  matter  of  fact  the  winter  dip  is  slightly  greater  than  the  other. 
But  no  weight  can  be  attached  to  so  slight  a  difference,  and  we  must  apparently  regard 
a  dry  seed-time  and  a  dry  subsequent  winter  as  about  of  equal  importance.  Sir 
Napier  Shaw  appears  to  have  been  the  first  to  call  attention  to  the  gi-eat  advantage 
of  a  dry  autumn,  and  my  calculations  fifteen  years  ago  pointed  to  this  period  being 
more  important  than  the  winter.  It  is  not  very  clear  from  the  writings  of  Gilbert 
and  Lawes  how  far  they  realised  this  feature  of  the  wheat's  requirements  ;  it  would 
seem  that  they  were  aware  of  it,  but  they  lay  all  stress  upon  the  winter  rainfall  and 
the  washing  of  the  nitrates  out  of  the  soil.  Wheat  is  a  deep-rooting  plant,  and  I 
believe  it  to  be  probable  that,  if  experiments  could  be  devised  to  test  the  point, 
it  would  be  found  that  the  real  effect  of  a  very  wet  autunm  is  mechanical — that 
it  is  then  more  the  clogging  of  the  soil  that  prevents  the  proper  development  of  the 
root  system  than  the  absence  of  sufficient  nitrates.  The  effect  of  the  loss  of  the 
latter  is  probably  more  felt  later  on,  during  the  winter,  after  the  plant  is  established, 
and  this  is  reflected  in  the  heavy  negative  coefficient  with  rain  during  the  winter 
months.  Gilbert  and  Lawes  have  laid  so  much  emphasis  upon  the  washing  out 
of  the  nitrates  that  it  is  generally  overlooked  that  they  did  in  fact  recognise  the 
hindrance  to  root  development  caused  by  saturated  soil.  It  seems  reasonable  to 
conclude  from  the  data  that  the  mechanical  effect  of  the  autumn  is  just  as  important 
as  the  chemical  effect  of  the  winter  ;  and  it  is,  moreover,  one  that  is  practically 
irremediable." 

The  statement  that  the  winter  dip  is  slightly  gi'eater  refers  to  the  partial  correlations 
obtained  after  ehminating  the  effects  of  temperatm'e  ;  we  are  here  only  concerned  with  the 
total  effects,  and  it  is  apparent  that  Hooker's  data  contain  substantial  evidence  for  a 
real  deleterious  effect  of  Autumn  rainfall.  In  other  respects  the  series  of  correlations  is 
not  out  of  harmony  with  several  of  the  Broadbalk  plots,  the  characteristic  period  of 
winter  damage  is  clearly  recognisable  in  both,  and  we  may  identify  the  positive  corre- 
lations obtained  in  late  Spring  with  the  minimum  of  damage  observed  on  several  of 
our  plots  at  the  same  season.  The  July  values  also  are  suggestive  of  the  more  highly 
nitrogenous  plots  on  Broadbalk.  The  fact  that  Hooker's  series  appears  to  incline  on 
the  whole  to  more  positive  values  than  are  observed  in  our  regressions  is  not  improbably 
due  to  the  inclusion  in  his  data  of  yields  £rom  lighter  lands,  more  susceptible  to  sununer 
droughts  than  is  the  heavy  loam  of  Broadbalk  ;  in  addition  his  mean  rainfall  is  some 
four  inches  less  than  in  our  series. 

The  earlier  half  of  the  series,  in  which  the  Broadbalk  plots  are  most  alike,  is  alone  in 
contradiction  to  Hooker's  series.  Of  course,  if  strong  meteorological  correlations 
existed  between  the  several  parts  of  the  year,  no  resemblance  need  be  expected  between 
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the  actual  regressions  and  the  correlations  which  are,  in  a  complicated  manner,  com- 
poiuided  fi-om  them  ;  and  even  slight  meteorological  correlations  might  seriously  disturb 
the  numerical  values  of  the  correlations  with  the  crop  ;  but  our  meteorological  data 
show  that  such  correlations  between  weather  at  different  periods  of  the  year,  though 
probably  present,  are  quite  small ;  and  since  the  agreement,  in  its  general  qualitative 
outline,  has  not  been  disturbed  from  January  onwards,  it  is  probable  that  other  factors 
contribute  to  the  striking  difference  observable  in  the  effects  of  October  rain.  In 
particular  it  may  be  noted  that  Hooker's  values  at  this  period,  allowng  for  the  fact  that 
adjacent  points  are  not  independent,  but  have  four  identical  weeks  in  common,  show 
somewhat  abrupt  changes  ;  the  values  found  for  November-jDecember  agree  well  with 
oiu'  maximmn  in  October,  and  the  damage  ascribed  by  Hooker  to  "  seed-time  "  may 
be  paralleled  by  the  somewhat  increased  damage  shown  in  many  of  oux  curves  in 
September.  In  this  connection  an  investigation  is  desirable  of  the  accuracy  with  which 
Hooker's  process  of  correlating  with  sets  of  8-consecutive  weeks,  is  able  to  indicate 
the  true  maxima  or  minima  of  the  correlation  curve  ;  and  to  what  errors  such  estimates 
are  exposed  owing  to  the  capricious  incidence  of  heavy  rain. 

A  consideration  which  more  probably  contains  a  solution  of  the  discrepancy  is  that 
under  industrial  farming  conditions  rain  in  late  autumn  and  early  winter  prevents  the 
sowing  of  large  areas  of  wheat,  the  land  being  sown  in  spring  with  oats  or  barley.  This 
is  a  factor  which  must  strongly  influence  Hooker's  residts,  whereas  under  experimental 
farming  conditions  it  is  inoperative.  Hooker  mentions  that  the  wheat  area  has  a 
coixelation  —  0-41  with  rain  of  weeks  37-44,  but  does  not  mention  the  correlation  of  neigh- 
bouring periods  ;  these  it  would  be  necessary  to  know  in  tracing  out  the  influence  of  the 
variable  wheat  areas  on  the  yield. 

It  would  seem  probable  that  the  proportion  of  land  lost  to  wheat  in  this  way  should 
differ  from  district  to  district,  and  even  more  probable  that  such  loss  should  not  be  jjro- 
poi-tionately  distributed  between  the  two  main  classes  of  wheat  land  in  which  the  wheat 
follows  respectively  clover  and  roots.  There  are  thus  many  points  at  which  the  effect  of 
autuiim  rainfall  upon  cropping  might  give  rise  to  an  apparent  influence  on  yield. 

It  is  admittedly  by  no  means  necessary  that  the  response  of  crop  to  weather  should 
be  the  same  on  one  particular  type  of  land  as  it  is  o)i  the  average  wheat  land  of  several 
comitries  ;  the  ciuwe  of  regression  may  be  as  much  influenced  by  soil  type,  as  we  now 
know  it  to  be  by  the  manurial  condition  of  the  soil.  In  this  respect  data  of  crop  averages 
grown  under  industrial  conditions  would  be  of  the  greatest  value  in  supplementing  by 
shorter  series  over  a  more  extensive  area  the  long  series  from  a  single  field  which  the 
Broadbalk  data  provide.  It  is  to  be  feared,  however,  that  the  Ministry  of  AgTicidture's 
returns  are  not  sufficiently  accurate  to  meet  this  need,  being  based  principally  uj)on 
eye  estimates  of  yield  before  harvest.  Such  estimates  may  be  closely  correlated  with 
the  true  yields,  and  yet  fail  to  give  the  values  of  the  required  regressions  even  with 
approximate  accuracy  ;  for  there  is  reason  to  fear  that  the  deviations  from  the  known 
mean  yield   are  systematically,  though  of  course  unconsciously,  underestimated.     In 
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inquiries  respecting  the  weather  it  would  be  inipoitant  to  ascertain  also  that  the  known 
meteorological  peculiarities  of  the  year  under  review,  the  supposed  effects  of  which 
upon  the  crops  may  have  been  discussed  in  the  press,  are  without  systematic  influe}ice 
u]3on  the  judgments  of  repoi-ters. 

J  2.  SioiutHirij. 

The  study  of  the  theoretical  distributions  of  statistics  emphasises  the  dangers  of 
applying  the  methods  of  nuiltiple  correlation  to  small  samples,  and  the  necessity  of 
extensive  crop  data  in  the  study  of  meteorological  agriculture. 

By  a  special  procedm'e  involving  the  analysis  of  separate  meteorological  sequences  it 
is  possible  to  obtain  an  adequate  mathematical  expression  of  the  average  effects  of  the 
meteorological  influences  indicated  by  different  instrumental  observations  at  different 
times  of  the  year. 

The  errors  involved  in  the  correlation  of  residuals  of  series  changing  in  an  miknown 
manner  may  be  minimised  by  the  method  of  polynomial  fitting  ;  such  errors  are  probably 
insignificant  when  this  jKocedure  is  applied  to  rain  data  and  wheat  yields. 

The  rain  data  for  liothamsted  have  been  analysed  for  ()5  years  ;  there  are  some  indica- 
tions that  the  wet  years  tend  to  occur  in  spells  ;  a  continuous  and  progressive  change 
is  observable  in  the  distribution  of  rain  through  the  year  ;  in  other  respects  the  sequence 
appears  to  be  fortuitous.  Rainfall  changes  account  for  only  a  ])ortion  of  the  slow  changes 
observed  in  the  yields. 

Curves  showing  the  average  effect  on  the  yield,  for  each  additional  inch  of  rain,  thrijugh- 
out  the  year,  have  been  obtained  for  13  plots  of  Broadbalk  wheat  field,  which  have  been 
under  uniform  experimental  treatment  since  1852.  On  all  the  plots  dry  weathei-  is 
generally  beneficial.  A  detailed  comparison  of  the  several  plots  indicates  a  predominant 
influence  of  the  effect  of  rain  in  removing  soil  nitrates  ;  the  cause  of  other  well-marked 
featiu'es  cannot  safely  be  asserted  without  further  research,  which  it  is  hoped  may  be 
facilitated  by  the  body  of  facts  expressed  in  these  curves. 

Previous  investigations  bearing  particularly  on  the  present  data  are  briefly  discussed 
i}i  the  last  section. 
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IX.  The  Interaction  of  Clmj  loith.  Water  and  Oiyanir.  Liquids  a.s  Measui-ed  hj 
Specific  Volume  Changes  and  its  Relation  to  the  Vhenontena  of  (rund) 
For/nation  in  Soils. 

By  E.  W.  Russell,  M.A.,  Ph.D. 
{Soil  Physics  Department,  Rothamsted  Experimental  Station,  Harpenden.) 

{Communicated  by  Sir  John  Russell,  F.R.S.) 

(Eeceived  December  29,  1933.— Read  April  26,  1934.) 

The  densities  of  clay  particles  determined  by  the  specific  gravity  bottle  method  vary 
somewhat  according  to  the  liquid  used.  Thus  water,  alcohol,  and  benzene  all  give 
slightly  different  values.  The  results  show  that  some  condensation  of  the  liquid  on 
the  surface  of  the  particles  usually,  if  not  invariably,  takes  place,  indicating  a  certain 
amount  of  interaction  between  the  clay  and  the  liquid. 

The  investigation  described  in  this  paper  shows  that  this  interaction  depends  partly 
on  the  exchangeable  bases  associated  with  the  clay,  and  partly  on  the  molecular 
constitution  of  the  liquid.     The  relations  are  as  follows  : 

(1)  No  evidence  could  be  obtained  of  interaction  between  clay  and  non-polar  liquids. 

(2)  Interaction  took  place  in  all  the  polar  liquids  examined.  Its  amount  was  measured 
by  the  reduction  in  specific  volume  of  the  clay  as  compared  with  its  volume  in  tetralin, 
a  very  convenient  non-polar  liquid  for  specific  gravity  determinations.  This  reduction 
in  specific  volume  is  approximately  proportional  to  the  number  of  exchangeable  cations 
present,  figs.  10,  11,  and  to  the  mean  density  of  their  surface  charge.  Table  IV.  Thus 
the  order  of  effectiveness  of  the  cations  is  :  magnesium — most  active,  calcium,  hydrogen, 
sodium,  potassium — least  active.  This  order  is  independent  of  the  liquids  used  provided 
no  chemical  action  takes  place  between  the  exchangeable  cation  and  the  liquid. 

(3)  A  similar  effect  is  produced  by  some  other  property  of  the  clay  associated  with 
the  shape  of  the  titration  curve. 

(4)  The  amount  of  interaction  depends  on  the  liquid,  but  no  simple  relations  with 
the  molecular  constitution  have  yet  been  discovered. 

(5)  During  the  process  of  removing  the  liquid  by  drying,  the  clay  particles  may 
aggregate  to  form  definite  crumbs. 
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The  Phenomena  of  Cnimb  Formation 

Considerable  attention  has  been  paid  to  crumb  formation  because  of  its  importance 
in  natural  soils.  Good  tilth,  which  is  essential  for  successful  plant  growth,  is  largely- 
dependent  on  a  stable  crumb  structure  of  the  soU,  and  much  of  the  art  of  cultivation 
consists  in  the  production  of  this  structure  prior  to  the  sowing  of  the  seed.  Considerable 
success  has  been  attained  by  purely  empirical  methods,  but  hitherto  no  satisfactory 
explanation  of  the  mechanism  of  crumb  formation  has  been  offered. 

Crumb  particles,  similar  to  those  found  in  nature,  can  be  produced  artificially  in 
the  laboratory  by  mixing  clay  with  water  and  allowing  it  to  dry  vmder  definite  conditions. 
These  crumbs  are  characterized  by  two  distinct  properties  :  they  are  hard  when  dry 
and  they  possess  fairly  characteristic  shapes.  The  following  are  the  main  phenomena 
of  crumb  formation  and  stability  brought  out  in  the  present  investigations. 

(1 )  Crumb  formation  is  limited  to  those  particles  possessing  considerable  base  exchange 
capacity.  It  is  not  shown  by  any  of  the  kaolinitic  clays  of  low  base  exchange  capacity 
so  far  examined. 

(2)  Crumb  formation  is  further  limited  to  particles  below  a  certain  size.  All  the 
fine  clay  particles  formed  aggregates,  though  no  fine  clay  has  as  yet  been  found  with 
a  low  base  exchange  capacity.  Silt  particles  of  settling  velocity  exceeding  3-5  jx/sec. 
did  not  form  aggregates  however  large  their  base  exchange  capacity  might  be. 

The  coarser  clay  minerals  and  the  fractions  devoid  of  base  exchange  are  thus  not 
essential  for  aggregate  formation,  but  seem  to  serve  only  as  a  diluent,  keeping  the 
aggregates  more  open  and  mechanically  weaker. 

(3)  Crumb  formation  is  induced  only  by  those  liquids  the  molecules  of  which  have 
an  appreciable  dipole  moment.  Thus  water,  methyl,  ethyl,  and  amyl  alcohols,  and 
nitrobenzene,  all  gave  definite  aggregates,  while  benzene  and  carbonate  tetrachloride 
gave  no  real  aggregates  at  all. 

(4)  The  hardness  of  the  crumbs  is  dependent  on  the  sizes  of  the  clay  particles,  of 
the  exchangeable  cations  and  of  the  molecules  of  the  wetting  liquid.  If  any  of  these 
be  large  the  aggregates  formed  are  soft :  only  when  they  are  all  small  are  the  aggregates 
hard.  Thus,  soft  aggregates  are  formed  from  coarse  clay  particles,  however  large 
their  base  exchange  capacity  ;  from  clays  holding  large  complex  cations  such  as 
phenyl-benzyl-dimethyl-ammonium  ;  and  from  clays  of  small  particle  size  and  small 
cations  wetted  with  liquids  like  nitrobenzene  or  amyl  alcohol,  although  these  same  clays 
if  wetted  with  methyl  or  ethyl  alcohol  form  hard  aggregates. 

(5)  The  process  of  crumb  formation  may  be  reversible  or  irreversible  ;  in  the  former 
process  the  crumbs  fall  down  to  a  paste  on  rewetting,  in  the  latter  they  do  not,  but 
remain  stable.  The  stability  depends  on  the  liquid  and  on  the  exchangeable  cations 
of  the  particles.  Sodium  clay  aggregates  are  unstable  in  water,  though  stable  in  dry 
amyl  alcohol.  Calcium  clay  aggregates,  on  the  other  hand,  are  stable  in  water  though 
unstable  in  drv  aiiivl  alcohol. 
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The  formation  of  crumbs,  therefore,  appears  to  depend  on  an  interaction  between 
the  exchangeable  cations  of  the  clay  and  the  dispersion  liquid. 

The  Mechanism  of  Crumb  Formation 
Clay  particles  suspended  in  water  possess  an  electrokinetic  potential  and  contribute, 
as  ScHOFiELD*  has  shown,  to  the  electrical  conductivity  of  the  suspension.  This 
mdicates  that  each  particle  is  surrounded  by  an  electrical  double  layer,  the  outer  one 
being  diffuse  and  consisting  of  cations,  while  the  inner  layer  consists  of  negative  charges 
presumably  anchored  on  the  surface  of  the  particle.  The  cations  in  the  diffuse  layer 
move  about  in  the  water  in  the  same  way  as  they  do  aroimd  a  complex  anion,  as  pictured 
in  the  Debye-Hiickel  theory  of  strong  electrolytes.  Smce  the  water  molecules  possess 
a  dipole  moment,  they  tend  to  be  orientated  along  the  lines  of  electric  force  radiating 
from  each  ion  m  the  diffuse  layer  and  from  each  free  charge  on  the  surface  of  the  clay 
particle.  Every  cation  and  particle  is  thus  surrounded  by  an  envelope  of  orientated 
water  molecules,  and  the  orientation  manifests  itself  as  an  apparent  adsorption  or 
immobilization  of  water  by  the  clay.  Some  of  the  water  molecules  near  an  electric 
charge  may  be  so  strongly  orientated  that  they  apj^ear  bound  to  it,  their  heat  motion 
merely  making  them  oscillate  about  the  lines  of  force,  while  those  further  away  from 
the  charge  possess  only  a  statistical  orientation  around  it.  A  clay  particle  in  a  dilute 
suspension  can,  therefore,  be  pictured  as  consisting  of  a  central  core  surroimded  by 
a  surface  carrying  a  negative  charge.  Around  each  negative  charge  is  an  envelope 
of  water  molecules  which  are  more  strongly  orientated  the  nearer  they  lie  to  the  charge. 
Outside  this  surface  are  the  cations,  also  possessing  envelopes  of  orientated  water 
molecules.  Some  cations  are  so  close  to  a  negative  charge  on  the  surface  of  the  clay 
particle  that  the  two  water  envelopes  belonging  to  these  charges  overlap  and  the  water 
molecules  in  this  region  are  orientated  in  their  joint  field.  This  orientation  is  very 
strong,  since  the  negative  end  of  the  water  dipole  is  attracted  to  the  cation  and  the 
positive  end  to  the  particle's  surface,  as  shown  diagrammatically  in  fig.  I. 
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As  the  water  is  removed  the  deflocculated  clay  suspension  becomes  more  concentrated 
and  an  increasing  proportion  of  the  water  molecules  become  orientated  in  the  joint  field 
of  a  positive  and  a  negative  charge.  It  is  reasonable  to  assume  that  a  certain  number 
of  cations  will  share  their  orientated  envelopes  with  two  clay  particles,  as  is  shown 
diagrammatically  in  fig.  2. 

A  linkmg  system  is  thus  set  up  consistmg  of  : — 
particle — orientated  wetting  molecule — cation — orientated  wetting  molecule — particle.* 

As  removal  of  water  proceeds,  an  increasing  proportion  of  cations  share  their  water 
envelopes  with  two  clay  particles,  and  so  the  nmnber  of  Imks  increases.  The  links  also 
become  stronger,  because  they  become  shorter  and  the  mean  orientatmg  couple  on  the 
water  molecules  increases.  In  consequence,  the  cohesion  of  the  clay  particles,  i.e.,  the 
hardness  of  the  crumbs,  increases. 

The  hypothesis  involves  two  assumptions,  which  have  not  yet  been  directly  tested, 
though  there  is  some  indirect  evidence  in  their  favour  : — 

(1)  that  a  system  of  the  type  postulated  is  in  fact  mechanically  stable  ; 

(2)  that  the  number  of  cations  dissociating  from  a  clay  jiarticle  in  a  system  that  is 

losing  water  decreases  less  rapidly  than  the  number  of  water  molecules. 

The  hypothesis  accounts  for  the  formation  of  crumbs  by  other  polar  liquids  as  well  as 
water,  and  for  their  non-formation  by  non-polar  liquids.  It  also  explains  the  necessity 
for  the  clays  to  possess  dissociating  cations,  i.e.,  base  exchange  capacity. 

Further,  it  explains  the  observed  relation  between  the  hardness  of  the  aggregates  and 
the  size  of  the  clay  particles,  the  cations  and  the  molecules  of  the  wetting  liquid.  For  the 
smaller  the  clay  particles,  the  larger  the  number  of  links  2)er  miit  volume  between  them, 
jsrovided  the  surface  of  the  clay  particle  be  curved  and  not  plane.  Again,  a  large 
univalent  cation,  such  as  the  phenyl-benzyl-dimethyl-ammonium  ion,  has  a  much  smaller 
surface  charge  density  than  a  potassium  ion,  and  so  forms  weaker  dipole-cation-dipole  links 
and  therefore  weaker  aggregates  than  those  of  potassium  clay.  Finally,  the  strength 
of  the  bond  between  cation  and  clay  particle  increases  with  the  dipole  moment  of  the 
liquid,  provided  the  position  of  the  dipole  m  the  molecule  remains  about  constant. 
But  if  one  end  of  the  dipole  becomes  large,  as  for  exanq)le  in  amyl  alcohol  and  nitro- 
benzene where  the  positive  ends  are  the  groups  C5H11  or  C^Hj  while  the  negative  end  is 
the  simple  group  -OH  or  -NO2,  then  the  positive  end  of  these  dipoles  camiot  be  strongly 
attracted  to  the  negative  charges  on  the  clay  surface,  and  so  cannot  form  a  strong 
binding  linlc.  This  is  in  agreement  with  the  experimental  result  that  hard  crumbs  can 
be  formed  from  water  and  ethyl  alcohol,  when  both  ends  of  the  dipole  are  small,  but 
not  from  amyl  alcohol  or  nitrobenzene. 

Crumb  formation  is  thus  seen  to  occur  when  each  end  of  the  dipoles  of  the  molecules 
of  the  wetting  liquid  is  reactive,  so  that  one  end  interacts  with  the  negative  charge  on 
the  clay  particle  and  the  other  with  the  positive  charge  on  the  dissociated  cation.     If, 

*  For  couvenienco  this  is  called  in  later  paragraphs  the  dipole-cation-dipole  link. 
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however,  only  one  end  of  the  dipole  is  strongly  reactive,  then  hard  cruiiibs  are  iiot 
formed  though  considerable  interaction  with  the  clay  may  still  take  place  as  shown  by 
the  reduction  in  specific  volume.  For  example,  clays  interact  as  strongly  with  amyl 
alcohol  as  with  ethyl  alcohol,  and  with  considerable  dimmution  in  specific  volume  in 
each  alcohol,  yet  they  form  hard  crumbs  from  ethyl  and  not  from  amyl  alcohol. 

The  injlucnce  of  Electrolytes  on  Particle  Cementation  :    Relation  of  Crumb  Formation 

with  Flocculation 

The  phenomena  of  crumb  formation  on  drying  after  wetting  with  water  are  modified 
in  the  presence  of  electrolytes. 
Two  efiects  are  produced  : — 

( 1 )  the  crumbs  are  smaller  and  weaker  ; 

(2)  provided  the  concentration  of  the  electrolyte  exceeds  a  certain  value  all  crumbs 

are  stable  in  the  solution  of  the  electrolyte. 

Both  may  be  attributed  to  the  reduction  in  dissociation  of  the  cations  from  the 
clay  surface  caused  by  the  electrolyte. 

This  reduces  the  number  of  dipole-cation-dipole  links  producible  on  drying  and  there- 
fore weakens  the  resulting  crumbs. 

While  crumb  instability  has  not  yet  been  studied  iix  detail  there  is  reason  to  connect 
it  with  the  dissociation  from  the  clay  particles  of  a  large  number  of  cations  having 
voluminous  hydration  or  solvation  envelopes.  The  presence  of  the  electrolyte,  by 
decreasing  both  dissociation  and  hydration  of  the  cations,  therefore  increases  the 
stability  of  the  crumbs. 

Analogies  have  often  been  traced  between  flocculation  {i.e.,  floe  formation)  in  a  clay 
suspension  and  crumb  formation  in  a  drying  soil ;  actually  these  analogies  are  not  valid, 
the  jjrocesses  being  difl'erent.  Floe  formation  has  recently  been  discussed  by  the 
author*  for  aqueous  solution  and  by  McDowell  and  UsherI  for  organic  liquids.  It 
occurs  only  when  the  electrokinetic  potential  of  the  clay  particle  is  low,  i.e.,  when  only 
small  dissociation  of  the  cations  occurs.  Crumb  formation,  on  the  other  hand,  occurs 
when  there  is  an  appreciable  amount  of  dissociation  of  cations  from  the  surface,  i.e.,  when 
the  electrokinetic  potential  is  high. 

Floes  have  a  very  opeii  heterogeneous  structure  and  a  large  volume  in  comparison 
with  that  of  their  component  particles  :  they  are  weakly  aggregated  and  can  be  broken 
up  by  shakmg.  Crumbs,  on  the  other  hand,  have  a  much  closer,  more  homogeneous 
structure,  and  can  be  much  more  strongly  held  together. 

No  satisfactory  hypothesis  has  been  put  forward  to  account  for  the  cohesion  of  clay 
particles  into  floes  and  therefore  no  detailed  comparison  with  the  mechanism  of  crumb 
formation  can  be  made. 

*  '  J.  Agric.  Sci.;  vol.  22,  p.  165  (1932). 

t  '  Proc.  Roy.  Soc.,'  A,  vol.  131,  pp.  409,  564  (1931). 
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The  essential  difference  between  the  two  processes  is  brought  out  by  comparing  the 

effect  of  adding  calcium  chloride  to  a  5%  suspension  of  calcium-clay  in  water,  with  the 

effect  of  adding  sodium  chloride  to  a  5%  suspension  of  sodium-clay.     The  calcium-clay 

forms  floes  which  settle  to  a  fairly  loose  sediment,  but  the  sodium-clay  forms  no  floes 

while  the  whole  suspension  sets  and  becomes  rigid  and  thixotropic.     The  calcium 

chloride  strongly  depresses  the  dissociation  of  calcium  ions  from  the  clay  surface,  so 

that  floe  formation  can  take  place,  while  sodium  chloride  cannot  depress  the  dissociation 

of  sodium  ions  from  the  clay  to  anything  like  the  same  extent,  so  that  floe  formation 

cannot  take  place.     The  sodium  and  chloride  ions  introduced  by  the  added  salt,  as  well 

as  the  sodium  ions  that  have  dissociated  from  the  clay,  have  envelopes  of  orientated 

water  molecules  around  them,  and  so  can  form  links  of  the  type 

+  _-  +         -+         -         +~  +         -        + 

(dipole)  (cation)  (dipole)  (anion)  (dipole)  (cation)  (dipole) 

which  can  bind  two  clay  particles  together.  This  link  is  of  the  same  type  as  the  simple 
dipole-cation-dipole  link  illustrated  in  fig.  2,  but  it  is  longer,  it  can  hold  much  more 
water  and  therefore  becomes  effective  at  much  higher  water  concentrations.  The  length 
is  responsible  for  the  thixotropic  behaviour  of  these  rigid  suspensions,  since  if  the  system 
is  violently  disturbed  the  binding  links  are  broken  down  and  take  a  certain  time  to 
build  up  again. 

Exferimental 

Materials — Clays  separated  from  the  following  soils  were  used  in  this  investigation. 

Alluvial  or  drift  soils — Clay-with-flints  (Rothamsted),  Boulder  clay  (Samuelstown, 
Haddington),  three  London  Clays  (Whitstable,  Heme  Connnon  and  Chessington,  near 
Surbiton),  Carse  soil  (Larbert),  Fen  soil  (Cambridgeshire),  Gezira  soil  (Wad  Medani, 
Sudan)  and  a  Palestine  soil  (Mikveh-Israel). 

Soils  from  solid  formations — Old  Red  Sandstone  (Putley,  near  Ledbury),  Triassic 
Keuper  Marl  (Long  Ashton),  Lower  Lias  (Axminster),  two  lower  Oolitic  soils  (Andover- 
ford,  Gloucester,  and  Redbourne  Kirton-in-Lindsey)  and  an  Oxford  clay  (Bampton 
Lew). 

To  prepare  the  separated  clays,  the  soils  were  usually  first  treated  wnth  sodium 
hyjDobromite  to  oxidise  and  to  bring  into  solution  much  of  the  organic  matter,  and 
to  assist  in  the  breaking  up  of  the  soil  crumbs.  The  soil  was  then  rej^eatedly  dispersed 
in  a  large  volume  of  dilute  (0-05%)  sodium  carbonate.  The  material  that  remained 
in  suspension  in  the  top  20  cm  after  24  hours'  standing  was  then  syphoned  off  and 
flocculated  with  dilute  acid.  The  bases  were  then  washed  out,  with  dilute  HCl  (less 
than  N/50)  in  the  first  series  of  separations,  and  with  half-neutralized  sodium  acetate 
ipti  about  4  •  5)  in  the  later  series,  until  the  sujiernatant  liquid  was  calcium-free  ;  the 
acid  or  sodium  acetate  was  then  washed  out  with  distilled  water  until  the  clay  was 
deflocculated.  In  the  first  series  of  experiments  the  clay  was  divided  into  aliquots, 
and  each  was  saturated  with  a  given  base  by  adding  the  corresponding  acetate.     The 
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acetates  were  added  rejjeatedly  until  the  p^  of  the  sujiernatant  liquid  was  about  7, 
when  the  alkaline-earth  acetates  were  washed  out  with  distilled  water,  and  the  alkali 
acetates  with  distilled  water  followed  by  alcohol.  In  the  second  series  of  experiments 
the  purified  hydrogen  or  sodium  clay  was  passed  through  a  Sharpies  supercentrifuge 
to  remove  the  coarse  clay  and  fine  silt  particles  present,  so  that  the  deflocculated  clay 
suspension  would  not  settle  appreciably  during  preparation,  thus  ensurmg  that  the 
crumbs  produced  should  have  as  uniform  composition  as  possible.  Instead  of  preparing 
the  different  homionic  clays — that  is,  clays  containing  only  one  exchangeable  base — by 
adding  the  acetate,  either  of  two  buffer  solutions  were  used  :  half-neutralized  p.rdtTo- 
phenol — 2^n  about  7,  and  half-neutralized  acetate — pn  about  4-5,  in  each  of  which  the 
base  concentration  was  N/50.* 

The  liquids  used  for  wetting  were,  whenever  possible,  of  A.R.  standard  of  purity, 
and  were  always  dried  either  over  phosphorus  pentoxide  or  over  freshly  ignited  hme 
for  3-4  weeks  and  distilled  just  before  use.  No  more  refined  drying  technique  was 
employed,  as  the  liquids  were  in  contact  with  the  air  for  short  periods  and  so  could 
absorb  traces  of  water  vapour  just  before  and  just  after  wetting  the  crumbs. 

The  Experiments  on  Crumb  Formation  and  Crumb  Stability 

(a)  Relation  of  partide  size  to  hardness  of  aggregates — Silt  fractions  were  separated 
from  the  Carse,  Lower  Lias  and  Gloucester  soils,  and  coarse  clay  fractions  from  the 
Gloucester  and  one  of  the  London  clay  soils.  The  coarser  materials  were  easily 
separated  from  the  silt  and  clay  by  sedimentation,  the  silt  used  having  a  settling 
velocity  between  20  and  3-5  (j./sec.  The  fine  clay  crumbs  were  dispersed  by  repeated 
alternate  treatment  with  sodium  hypobromite  and  boiling  with  hydrogen  peroxide, 
and  the  clay  was  removed  by  repeated  dispersion  in  sodium  carbonate.  These  treat- 
ments were  continued  until  no  appreciable  amoimt  of  clay  came  into  suspension.  The 
coarse  clay  fractions  were  prepared  from  the  complete  clay  fraction  by  passing  the 
deflocculated  clay  through  a  Sharpies  sujjercentrifuge,  removing  the  coarse  fraction 
from  the  bottom  of  the  bowl  and  repeatedly  passing  this  fraction  through  the  centrifuge 
to  remove  the  finer  clay  particles  from  it.  The  settling  velocities  of  these  clay  particles 
camiot  be  calculated  from  their  method  of  preparation,  nor  has  an  ultra-microscopic 
count  been  made  on  the  dispersion,  so  that  it  is  only  possible  to  give  the  upper  limit 
of  their  settling  velocities,  namely,  3  •  5  (x/sec.  The  base  holding  power  of  these  materials 
between  ScHOFiELD'sf  hydrochloric  acid  level  and  his  ^j-nitrophenol  level  are  in  milli- 
equivalents/100  grams  : — 

For  the  silts  :   Carse  2-2,  Lower  Lias  3-5,  Gloucester  10-8  ; 

For  the  coarse  clays  :    Gloucester  33-5,  London  I  23-2. 

*  The  author  is  indebted  to  Dr.  R.  K.  Schofield  for  bringing  to  his  notice  the  advantage  of  using 
strongly  bufiered  solutions  for  the  preparation  of  homionic  clays,  and  for  suggesting  the  most  appropriate 
weak  acids  to  use. 

t  '  J.  Agric.  Sci.,'  vol.  23,  p.  252  (1933). 
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On  drying,  or  on  freezuig  and  drying  from  a  water  dispersion,  these  materials  formed 
weak  blocks.  The  sUts  broke  down  immediately  to  a  fine  powder  when  touched  with 
the  finger,  while  the  coarse  clays  had  to  be  rubbed  between  the  fingers. 

(6)  Relation  of  size  of  the  exchangeable  ion  to  hardness  of  crutnb. — Two  separate 
fractions  of  a  Gloucester  clay,  a  medium  fine  and  a  very  fine  fraction,  were  divided  into 
five  aliquots,  which  were  saturated  up  to  joh  7  with  Ca,  Na,  K,  Mg  and  phenyl-benzyl- 
dimethyl-ammonium.  The  phenyl-benzyl-dimethyl-ammonium  chloride  was  prepared 
by  adding  equimolecular  quantities  of  dimethyl  aniline  and  benzyl  chloride  in  absolute 
alcohol,  leaving  in  the  dark  for  one  month  and  precipitating  the  salt  with  ether.  The 
crystals  were  then  washed  with  ether,  redissolved  in  alcohol,  reprecipitated  and 
rewashed  with  ether  and  dried.  The  acetate  or  chloride  was  then  washed  out,  and 
the  defiocculated  clay  paste  either  frozen  or,  if  it  was  the  sodium  or  potassium  clay, 
dried.  For  each  of  the  calcium  and  magnesium  clays  hard  crumbs,  diameter  about 
3  nun,  were  formed  when  dry,  but  for  the  substituted  ammonium  clay  the  crumbs 
Avere  smaller  and  crumbled  between  the  fingers  to  a  fine  powder  ;  this  clay  held  nearly 
60  milli-equivalents  of  exchangeable  base  per  100  grams,  and  fully  deflocculates  only 
with  difficulty.     When  dry  it  has  a  very  faint  smell  of  dimethyl  aniline. 

(c)  Relation  of  uvtting  liquid  to  hardness  of  crumbs — Clay  dispersions  made  up  in 
different  liquids  were  obtained  from  the  defiocculated  clay  in  water  by  removing  the 
water  with  alcohol,  the  alcohol  with  the  required  liquid  or  with  benzene,  or  the  benzene 
with  the  required  liquid.  At  each  renewal  of  the  liquid  the  clay  was  shaken  for  several 
hours  to  ensure  good  dispersion.  The  liquid  was  removed  with  Berkefeld  filter  candles 
attached  to  a  water  pump  through  sulphuric  acid  bottles,  only  dry  air  being  admitted 
to  the  bottles  containing  the  clay.  Freshly  distilled  dry  liquids  were  always  used 
finally.  These  clay  dispersions  were  then  dried  at  80°  C  over  P2O5  in  a  desiccator  kept 
at  a  low  pressure  with  an  oil  pump  containing  an  ice  condenser  and  silica-gel  adsorption 
tubes  in  the  vacuum  circuit.  Hydrogen-clays  separated  from  the  Old  Red  Sandstone, 
Lower  Lias  and  London  Clay  soils,  and  a  calcium  London  clay  gave  only  extremely  soft 
aggregates  from  benzene  and  carbon  tetrachloride,  not  quite  so  soft  from  amyl  alcohol, 
and  hard  fron\  ethyl  alcohol  dispersions,  while  definite,  though  fairly  soft,  crumbs  were 
formed  by  freezmg  and  drying  a  nitrobenzene  dispersion  of  the  hydrogen  Lower  Lias  clay. 

{d)  Relation  of  wetting  liquid  to  stahility  of  crumbs — If  a  crumb  formed  by  freezing 
a  water  ])aste  of  a  calcium  clay  is  thoroughly  dried  and  then  wetted  with  dry  amyl 
alcohol  under  vacuum  it  usually,  though  not  always,  broke  down  into  a  paste,  but 
no  sodium  clay  crumb  l)roke  down.  It  was  also  found  that  a  defiocculated  suspension 
of  a  calcium  clay  can  readily  be  formed  in  dry  amyl  alcohol,  whUe  it  has  not  yet  been 
possible  to  deflocculate  a  sodium  clay  in  it. 

Determination  of  the  Specif  c  Volume  of  the  Clays 

To  determine  the  specific  volume  of  the  clays,  about  1 0  grams  of  clay  crumbs  were 
put  in  a  25-cc   specific  gravity  bottle,  placed  over  PgOj  in  a  desiccator,  which  was 
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evacuated  by  the  i)il  puiiip,  and  left  for  a  week.  Fresh  PgO.^  was  then  })ut  m,  the 
desiccator  re-evacuated  and  left  for  at  least  two  weeks,  and  this  stage  would  be  repeated 
unless  the  surface  of  the  P2O-,  was  stUl  powdery.  For  non-volatile  liquids  wetting  was 
carried  out  under  vacuum  with  the  oil  pump  running,  but  for  volatile  liquids,  such 
as  water  or  the  alcohols,  this  method  was  not  satisfactory,  since  the  clay  crumbs  would 
be  shot  out  of  the  bottle  as  the  liquid  came  into  contact  with  them.  For  volatile 
liquids  two  different  methods  of  wetting  were  used.  In  the  first,  after  the  crumbs  had 
been  evacuated  for  about  20  minutes,  the  vacuum  circuit  was  shut  and  the  liquid  was 
run  into  the  bottom  of  the  desiccator,  and  after  this  had  been  left  for  about  10  minutes 
fresh  liquid  was  then  run  into  the  bottles  containing  the  crumbs.  This  method  was  still 
not  very  satisfactory  as  some  bumping  often  occurred  when  the  liquid  first  came  in 
contact  with  the  clay.  In  the  second  method  tried,  the  desiccator  was  evacuated,  then 
a  stream  of  the  vapour  was  passed  over  the  crumbs  to  enable  them  to  adsorb  an 
appreciable  amount  of  the  liquid,  then  the  liquid  itself  was  run  into  the  bottles.  This 
method,  which  was  used  only  at  the  end  of  the  experiments,  was  much  more  satis- 
factory. After  the  clay  had  been  wetted  under  vacuum,  the  bottles  were  put  into 
closed  tanks  containing  dry  paraffin  for  3-6  hours,  these  tanks  being  immersed  in  a 
large  water  thermostat  tank  rmining  at  about  25°  C.  Three  hours  was  found  to  be 
sufficiently  long  to  allow  the  clay  to  reach  equilibrium  with  the  liquid. 

In  the  calculation  of  the  apparent  specific  volume  of  the  clay  crumbs  the  density 
of  water  was  always  assumed  to  be  that  given  in  Landolt-Bornstein's  tables,  all 
other  densities  being  calculated  from  this.  Buoyancy  corrections  were  applied  to  the 
weights  of  the  clay  and  of  the  liquids,  the  air  density  being  determined  each  time  from 
the  barometer,  thermometer  and  wet  and  dry  bulb  readings  with  the  use  of  Landolt- 
Bornstein's  tables.  The  differential  buoyancy  correction  needed  for  the  layer  of 
adsorbed  liquid  was  not  large  enough  to  be  significant.  The  error  introduced  iix  the 
determination  of  the  specific  volume  of  the  clays  under  the  ex])erimental  conditions, 
due  to  limitations  in  the  accuracy  of  the  instruments  used,  is  about  10"*,  this  being  caused 
by  errors  of  0-1  mg  in  the  weighings  and  0-05°  C  in  reading  the  temjjerature  in  the 
thermostat,  these  two  sources  of  error  being  about  equal.  The  only  large  sources  of 
error  are  loss  of  clay  from  the  bottles  during  wetting  under  vacuum,  entrapping  of  air, 
and  loss  of  liquid  by  evaporation  between  taking  the  bottle  out  of  the  thermostat  and 
weighing  it.  Each  of  these  three  sources  of  error  will  make  the  specific  volume  of  the 
clay  appear  too  large,  and  as  a  matter  of  fact  no  case  has  yet  arisen  where  the  specific 
volume  appeared  to  be  obviously  too  low.  Duplicate  determinations  w'ere  not  made 
together,  and  everythmg  possible  was  done  to  make  the  maximum  amount  of  systematic 
error  occur  as  random  error  between  duplicates.  From  a  study  of  the  actual  deviations 
between  400  duplicate  determuaations  of  the  specific  volume  of  the  clays  available, 
only  43%  had  divergences  less  than  2-5  X  10"*,  in.  25%  it  was  between  2-5-5-0  X  10"*, 
in  12%  between  5-10  X  10"*,  and  in  20%  it  exceeded  lO'^.  Thus  in  20%  of  the  dupli- 
cate determinations  at  least  one  of  the  determinations  had  an  error  of  10  ^,  so  that 
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at  least  10%  of  the  individual  determinations  had  to  be  rejected  owing  to  one  of  the 
three  major  sources  of  error,  of  which  loss  of  clay  during  wetting  under  vacuum  was 
the  most  common.  It  thus  appears  probable  that  the  error  of  the  mean  is  usually 
about  2-5X10*.  It  is  possible  to  make  an  mdependent  estimate  of  this  error,  for  some 
of  the  determmations  were  repeated  at  different  times,  and  by  comparing  the  differences 
between  their  means,  approximately  the  same  figure  is  obtained.  In  conclusion, 
therefore,  the  error  in  a  difierence  between  means  will  be  about  5  X  10"*. 


Specific  Volume  of  Clays  in  Different  Liquids 

Few  of  the  methods  devised  in  general  colloid  science  to  measure  the  mteraction 
of  a  liquid  or  gas  with  a  solid  substrate  have  been,  or  can  be,  applied  to  clays,  and 
so  far  no  systematic  investigation  of  this  interaction  in  clays  has  been  undertaken. 
The  method  adopted  by  the  author  has  been  to  measure  the  apparent  density  of  carefully 
dried  soil  and  clay  crumbs  in  different  liquids.  This  method,  which  was  first  used  by 
Williams*  and  Cude  and  Hulett|  for  charcoal,  seems  to  have  been  used  for  soils 
only  by  Anderson  and  Mattson,J  who  found  that  dried  separated  clays  appeared  to 
be  heavier  in  water  than  in  toluene,  and  by  Tschapek§  who  found  that  soils  appeared 
heavier  in  water  than  in  the  four  hydrocarbons  benzene,  xylene,  petrol,  and  jsetroleum 
(?  parafiin),  in  which  the  density  was  about  the  same.  The  method  has  the  disadvantage 
that  the  results  are  not  easy  to  interpret,  since  four  separate  factors  can  influence  them  : — 

(1)  the  interaction  of  the  particle  surface  with  the  dispersion  medium  ; 

(2)  the  proportion  of  the  total  surface  area  of  the  clay  particles  forming  the  crumbs 

accessible  to  the  wetting  liquid  ; 

(3)  the  compression  of  the  liquid  in  the  fine  capillaries  of  the  crumbs  due  to  the 

large  curvature  of  the  solid-liquid  mterfaces  ; 

(4)  tlie  amount  of  adsorbed  and  entrapped  gases  in  the  crumbs  after  wetting. 

The  adsorbed  and  entrapped  gases  are  usually  removed  by  a  strong  outgassing  by  keeping 
the  system  at  a  high  temperature  and  a  very  high  vacuum  for  a  considerable  time,  but 
this  method  is  probably  inapplicable  to  clays  as  certain  irreversible  processes  seem 
to  attend  high  temperature  dehydration,  100-200°  C,  though  this  point  has  never  been 
adequately  examined.  For  this  reason,  and  also  because  the  work  was  largely  explora- 
tive, no  elaborate  degassing  technique  was  employed,  wetting  being  carried  out  slowly 
in  a  desiccator  connected  to  a  small  rotary  Hyvac  oil  pump. 

The  experiments  were  designed  to  show  the  influence  of  the  exchangeable  ions,  held 
by  a  clay,  on  the  apparent  specific  volume  of  the  clay  when  immersed  in  different 

*  '  Proc.  Roy.  Soc.,'  A.,  vol.  98,  p.  223  (1920). 
t  'J.  Amer.  Chem.  Soc.,'  vol.  42,  391   (1920). 
X  'U.S.  Dept.  of  Agric.  Bull.,'  1452  (1926). 
§  '  KoUoidzschr.,'  vol.  63,  p.  34  (1933). 
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liquids.  Separated  clays  were  used,  as  very  uniform  crumbs  can  be  obtained  from  them, 
so  that  sampling  errors  are  negligi})le  compared  with  other  sources  of  error.  This 
condition  can  barely  be  obtained  with  soils,  for  during  the  preparation  of  a  soil  holding 
one  metallic  cation  only,  the  coarse  and  fine  fractions  become  partially  separated,  through 
the  crumbs  breaking  down  in  the  solutions  used,  and  it  is  very  difficult  thoroughly 
to  remix  all  the  materials.  Smce  it  was  also  considered  more  important  to  use  only 
a  few  liquids  but  as  many  clays  as  possible,  only  two  liquids,  water  and  tetralin  (tetra- 
hydronaphthalene),  were  used  for  every  clay,  though  amyl  alcohol,  aniline  and  nitro- 
benzene were  also  used  for  most  of  them.  Isolated  experiments  in  a  number  of  other 
liquids  were  also  made.  Consideriag  these  five  liquids  only,  the  specific  volume  of  a 
clay  was  greatest  in  tetralin  and  next  in  nitrobenzene  without  exception.  With  the 
exception  of  some  of  the  hydrogen  clays,  the  specific  volume  in  aniline  was  greater  than 
in  water,  and  greater  in  amyl  alcohol  than  in  water,  with  the  exception  of  the  magnesium 
and  one  of  the  calcium  clays,  and  about  the  same  in  amyl  alcohol  and  aniline  for  the 
sodium,  potassium  and  most  of  the  calcium  clays.  Since  the  specific  volume  of  a 
clay  is  greatest  in  tetralin,  these  data  will  first  be  examined  to  see  how  far  they  support 
the  hypothesis  that  tetralin  has  no  interaction  with  the  exchangeable  ions  on  the  clay. 
The  specijic  volumes  of  days  in  tetralin — The  specific  volume  of  a  clay  in  a  liquid 
depends  on  the  followuxg  factors  : — 

(a)  the  specific  volume  of  the  clay  substrate  ; 

(6)  the  weights  and  volumes  of  the  exchangeable  ions  on  the  clay  ; 

(c)  the  interaction  between  the  exchangeable  ions  on  the  clay  particles  and  the 
wettmg  liquid  ; 

{d)  the  interaction  between  the  clay  substrate  and  the  wetting  liquid  ; 

(e)  for  dried  clay  crumbs  the  sizes  of  the  jDores  in  the  clay  crumb,  these  determining 
both  the  accessibility  of  the  liquid  molecules  to  the  clay  surface,  and  the  com- 
pression of  the  liquid  in  the  pores  due  to  interfacial  tension  forces. 

By  considering  the  apparent  specific  volumes  of  different  ionic  clays  m  a  given  liquid, 
it  should  be  possible  to  decide  if  the  variations  between  the  specific  volumes  of  a  clay 
holding  different  exchangeable  ions  can  be  accounted  for  completely  by  the  weights 
and  volumes  of  the  exchangeable  ions ;  if  so,  the  exchangeable  ions  would  have  no 
differential  effect  on  the  total  volume  of  the  fine  pores  in  the  crumbs,  nor  any  differential 
interaction  with  the  wetting  liquid. 

The  contribution  of  the  exchangeable  ions  to  the  specific  volume  of  the  clay  can 
be  calculated  as  follows.  Let  <7^  and  CTj,  be  the  specific  volumes  of  a  clay  holding  B 
equivalents  of  ion  X  and  ion  Y  respectively  in  a  given  wetting  liquid,  and  let  V,  V^, 
and  Yy  be  the  volumes  of  one  equivalent  of  clay,  ion  X  and  ion  Y  respectively  and  Mj^ 
and  My  the  equivalent  weights  of  the  two  ions,  then 


_  V  -f  BV,         ,       _  V  +  BV„ 
~  1  +BM.  '  ""'"'^  ""  ~  1+BM, 
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or,  on  eliminating  V 

1  +  BM/.       -r/V^-V, 


T) 

1  +  BM,'  \1  +  BM,, 

The  two  denominators  of  this  equation  can  be  expanded,  and  all  except  the  first  two 
terms  of  each  expansion  ignored  since  BM„  does  not  exceed  0  •  03  in  any  of  the  clays 
considered,  giving  as  an  approximate  equation 

a„  -  a,  =  Ba,{M,  -  M,)  -  B  (V,  -  V,) (1) 

Unfortmiately,  although  M^  and  M^,  the  equivalent  weights,  are  known,  V^  and  Vj,, 
their  equivalent  volumes,  are  not,  so  that  this  equation  cannot  be  used  to  calculate 
a,j  —  <Tj.  Using  the  experimentally  determined  values  of  Gj.  and  a„  for  a  series  of 
clays,  it  is,  however,  possible  to  examine  whether  V^.  —  V^,  calculated  from  this  equa- 
tion, is  independent  of  the  type  of  clay  used,  and  so  dependent  solely  on  the  ions  X 
and  Y,  and  whether  the  observed  values  of  V^.  —  V,,  appear  reasonable  on  other  grounds. 
The  results  given  in  table  I,  show  that  V^  —  Ny  is  nearly  independent  of  the  clay  type, 
though,  owing  to  the  large  experimental  errors  attached  to  this  difierence,  it  is  not 
possible  to  say  if  the  values  for  individual  clays  really  differ  significantly.  Table  I 
also  shows  that,  per  equivalent,  the  ionic  volume  of  the  ions  increase  in  the  order  sodium, 
hydrogen,  calcium  and  potassium,  magnesium. 

It  is  possible  to  obtain  further  evidence  for  the  validity  of  these  results  from  an 
analysis  of  the  differences  in  molecular  volume  between  different  crystals  belongiiig 
to  the  same  system.  For  example,  the  densities  of  sodium  chloride  and  potassium 
chloride  crystals  are  given  in  the  International  Critical  Tables  as  2-163  and  1-988, 
and  both  belong  to  the  cubic  system.  Hence,  one  mole  of  sodium  chloride  occupies 
27-0  ccm  and  of  potassium  chloride  37-5  ccm,  so  that  one  mole  equivalent  of  potassium 
is  10-5  ccm  larger  than  one  equivalent  of  sodium,  on  the  assumption  that  the  whole 
difference  in  volume  between  the  sodium  and  potassium  crystals  is  taken  up  by  the 
larger  size  of  the  potassium  ion,  and  ignoring  all  volume  changes  due  to  differences  of 
packing.  This  calculation  can  be  made  for  a  large  number  of  similar  sodium  and  potas- 
sium crystals,  and  for  the  six  available  monovalent  anions  tabulated  in  the  International 
Critical  Tables  the  value  of  Vk  —  Vn;.  varies  from  12-6  to  10-3,  with  a  mean  of 
11-3  ccm,  though  for  the  divalent  anion  crystals  the  figure  is  rather  lower.  This  is 
very  close  to  the  value  11-1  ccm  determined  from  the  specific  volume  of  the  clays. 
The  difference  between  the  calcium  and  magnesium  ions  is  more  difficult  to  determine 
since  the  crystals  of  the  corresponding  salts  usually  belong  to  different  systems,  though 
it  seems  to  vary  from  about  6 — 11  ccm  with  a  mean  of  about  8  ccm  per  mole,  so  that 
Vfa  —  Vmi4  =  4,  and  from  the  scanty  data  available  V^a  —  Vc„  is  about  3  ccm. 
Since  the  results  from  the  density  calculation  are  Vm„  —  V,,,  =  14-6  and  V,.a  —  Vn^  = 
10-0,  these  two  sets  of  results  are  neither  in  quantitative  nor  qualitative  agreement. 
Thus,  while  the  difference  between  the  volumes  of  sodium  and  potassium  ions  on  a 
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clay  is,  on  the  above  assumptions,  about  the  same  as  their  diSerence  in  crystals,  this 
does  not  hold  for  the  sodium-calcium  or  magnesium-calcium  difEerences. 

A  possible  explanation  of  this  discrepancy  is  the  neglect  of  the  water  molecules 
attached  to  the  exchangeable  ions  on  the  clay,  for  it  has  so  far  been  implicitly  assumed 
in  the  calculations  that  the  exchangeable  ions  on  the  dried  clays  used  were  either  them- 
selves dry  or  all  contained  the  same  number  of  molecules  of  water  of  hydration,  an 
assumption  that  is  almost  certainly  false.  The  effect  of  these  water  molecules  on  the 
specific  volume  of  the  different  ionic  clays  can  easily  be  evaluated.  Let  each  ty|3e  of 
ion  X  have  X^  molecules  of  water  attached  to  each  equivalent  of  X,  and  let  Vw  be  the 
mean  volume  occujiied  by  1  mole  of  water  of  hydration,  and  further  assume  that  Vw 
is  independent  of  the  exchangeable  ion,  then  usmg  the  same  notation  as  before,  the 
specific  volumes  of  a  clay  holdmg  B  equivalents  of  ions  X  and  Y  respectively  are 


CT.   = 


V  +  B(V.+  X..V,v)      ^^^^^     ^  _V  +  B(V.,+  x,V,,) 
1  +  B(M,+  18X,)       '  ■"       1  +  B(M, +  18X,) 


giving  as  the  equivalent  of  equation  (1) 

{a,  -  a.)/B  +  a,  (M,  -  M,)  =  (V,  -  V,)  +  (X,  -  X,)  (V,v  -  18a,).    ...  (2) 

Now  the  quantity  on  the  left-hand  side  of  this  equation  has  been  tabulated  in  Table  I 
mider  the  heading  (V,^  —  V;,).  If  now  this  quantity  is  written  as  AV  (obs)  and  the 
true  value  of  (Vj^  —  V,)  is  written  as  AV  (true),  then 


AV  (obs)  —  AV  (true) 

Vw     +     18(7, 


X,,_X,+  ^^  ^oos;  -  gj  ^uue; ^3^ 


If  it  is  assumed  that  A  V  (true)  is  the  value  of  (V„  —  V,)  calculated  from  crystal  density 
data,  and  if  Vw,  the  volume  of  1  mole  of  water  of  hydration,  is  assumed  to  have  the  same 
value  as  1  mole  of  water  of  crystallization  has  m  a  crystal,  then  X^  —  X„  the  difference 
in  hydration  between  one  equivalent  of  Y  and  of  X  can  be  calculated  from  this  equation. 
Unfortimately  Vw  is  not  easy  to  determine,  as  the  addition  of  water  of  crystallization 
to  a  crystal  usually  alters  the  system  to  which  it  belongs.  Ignormg  this  alteration  of 
system,  it  seems  to  vary  from  13-18  in  most  substances  examined,  though  usually  it  lay 
between  14-16  ccm  with  an  average  of  about  15-5  ccm.  The  values  of  X^.^ —  Xj,^ 
and  of  Xjij,  —  X,  ,^  have  been  calculated  for  all  the  clays  available  by  using  the  crystal 
values  —  3-3  ccm  and  —  4-0  ccm  for  A  V  (true)  and  by  using  Vw  =  15-5  ccm,  and 
are  given  in  Table  I.  As  is  seen  from  this  table,  the  mean  values  of  X^.^  —  X^a  and  of 
XjiK  —  ^Cii  are  1-52  and  2-11  respectively.  Thus,  one  equivalent  of  calcium  has,  on  the 
average,  1  •  5  more  moles  of  water  than  one  equivalent  of  sodium,  so  that  3  more  molecules 
of  water  of  hydration  are  attached  to  each  calcium  ion  than  to  each  sodium  ion. 
Similarly,  each  magnesium  ion  has  about  4  more  molecules  of  water  of  hydration 
than  each  calcium  ion.     As  a  consequence  of  the  assumption  that  the  specific  volume 
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of  a  clay  measured  in  tetralin  is  its  true  specific  volume,  then  on  a  carefully  dried  clay 
the  sodium  and  potassium  ions  have  the  same  number  of  molecules  of  water  of  hydra- 
tion, the  calcium  ion  has  about  3  more,  and  the  magnesium  ion  3  +  4  =  7  more 
molecules. 

An  attempt  was  made  to  obtain  inde])endent  evidence  for  the  correctness  of  this  con- 
clusion by  igniting  the  dry  Trias  and  London  I  clays  in  an  electric  furnace  at  650°  C  for 
2  hours,  in  the  hope  that  at  this  temperature  the  ions  would  have  lost  all  their  water  of 
hydration,  though  unfortunately,  owing  to  lack  of  time,  the  clay  samples  could  only  be 
dried  at  110°  C  for  20  hours  instead  of  being  much  more  intensively  dried  over  fresh 
P2O5  for  several  weeks  ui  a  vacuum  desiccator.  The  results  for  the  two  clays  were 
consistent,  and  showed  that  the  magnesium  ion  has  3-5  more  molecules  of  water  of 
hydration  than  the  potassium  ion,  the  calcium  ion  2-2  more,  and  the  sodium  only  0-5 
more.  Quantitative  agreement  between  the  results  of  this  method  and  those  derived 
from  the  specific  volume  of  the  clays  in  tetralm  would  not  be  expected,  but  both  methods 
show  that  the  exchangeable  ions  on  an  intensively  dried  clay  can  hold  water  of 
hydration. 

The  specific  volume  of  clays  in  other  non-polar  liquids — Only  a  few  experiments  have 
been  made  on  the  apjiarent  specific  volume  of  clays  in  other  non-polar  liquids  besides 
tetralin,  but  the  results  indicate  a  sharp  difference  in  behaviour  between  soil  and  clay 
crumbs.  Tschapek  {he.  cit.)  using  two  soils  and  four  non-polar  liquids,  and  the  author 
using  only  one  soil  (Rothamsted)  and  six  liquids  non-polar  or  of  very  slight  polarity 
(benzene,  toluene,  xylene,  parafiin,  tetralin  and  decalin)  found  that  the  specific  volume 
of  the  soil  was  the  same  in  all  the  hydrocarbons  used  within  the  limits  of  experimental 
error.  For  the  few  clays  examined,  which  include  a  Rothamsted  separated  clay,  the 
apparent  specific  volumes  of  dried  clay  crumbs  are  different  in  different  hydrocarbons, 
being  largest  m  decalin  and  carbon  tetrachloride,  next  in  tetralin  and  paraffin,  then  in 
xylene,  and  smallest  in  benzene.  These  differences,  though  definite,  are  not  large,  as  all 
these  specific  volumes  are  larger  than  the  specific  volume  of  the  dry  crumbs  in  nitro- 
benzene, which  gives  the  largest  specific  volume  of  the  polar  liquids  used.  The  simplest 
explanation  of  this  result  is  that  the  exchangeable  ions  do  not  interact  with  the 
hydrocarbons,  but  that  appreciable  capillary  condensation  of  the  liquid  occurs  m  the 
pores  of  the  dry  clay  crumbs  though  not  in  the  pores  of  the  dry  soil  crumbs.  This 
explanation  needs  the  assumption  that  the  pores  in  a  soil  crumb  are  larger  than  in  a 
clay  crumb,  an  assumption  that  is  almost  certainly  true,  although  not  enough  is 
known  about  the  actual  pore  size  in  the  crmnbs  to  judge  if  this  is  sufficient  to  cause 
the  difference. 

Specific  volume  of  clays  in  polar  liquids — This  discussion  will  be  limited  to  the 
differences  between  the  specific  volumes  of  a  clay  in  water,  amyl  alcohol,  aniUne  or 
nitrobenzene  and  m  tetralin.  Let  T  and  Lj  ,  3  or  4  he  the  specific  volumes  of  a  clay  in 
tetralin  and  in  one  of  the  four  liquids  just  mentioned,  then  (T  —  L)  will  give  a  quantity 
from  which  the  density  of  the  clay  substrate  and  equivalent  weights  and  volumes  of  the 
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exchangeable  ions  have  been  eliminated,  and  which  is  therefore  controlled  by  differential 
capillary  condensation  effects  and  by  the  differential  interaction  between  the  clay  and  the 
two  liquids.    These  quantities  (T  —  L)  have  the  following  properties  : — 

(I)  for  a  given  exchangeable  ion  on  the  clay  (T  —  LJ  is  proportional  to  (T  —  La)  for 
different   clays.      Fig.  3  shows  that  for  hydrogen  clays   (T  —  N)    is    apj)roximately 
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Fig.  3. — Specific  volume  reduction  of  hydrogen  clays  in  nitrobenzene  compared  to  water. 

proportional  to  (T  —  W),  where  T,  N,  W  are  the  specific  volumes  of  the  clay  in  tetralin, 
nitrobenzene,  and  water  respectively  ;  and  fig.  4  shows  that  for  both  hydrogen  and 
calcium  clays  (T  —  Am)  is  approximately  proportional  to  (T  —  W)  where  Am  is  the 
specific  volume  of  the  clay  in  amyl  alcohol ; 
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Fig.  4.— Specific  volume  reduction  of  hydrogen  and  calcium  clays  in  amyl  alcohol  compared  to  water. 
O  hydrogen  clays  ;    •  calcium  clays. 
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(2)  for  a  given  liquid  and  a  series  of  clays  (T  —  L)  for  one  exchangeable  ion  is  propor- 
tional to  (T  —  L)  for  a  second.  This  is  illustrated  in  figs.  5  and  6,  in  which  (T  —  W)  for 
the  calcium  clays  is  plotted  against  (T  —  W)  for  the  correspondmg  sodium,  potassium, 
magnesium  and  hydrogen  clays,  and  in  figs.  7,  8,  9,  in  which  (T  —  L)  for  the  calcium 
clay  is  plotted  against  (T  —  L)  for  the  corresponding  hydrogen  clays  where  L  is  nitro- 
benzene, aniline,  or  amyl  alcohol.  These  two  generalizations,  though  only  approximately 
true,  suggest  that  this  quantity  (T  —  L)  is  largely  controlled  by  the  exchangeable  ions  on 
the  clay,  and  not  by  the  clay  substrate  itself. 
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Fig.  5. — Specific  volume  reduction  of  potassium  and  Fig.  6. — Specific  volume  reduction  of  sodium  and  mag- 
hydrogen  clays  in  water  compared  to  calcium  nesium  clays  in  water  compared  to  calcium  clays, 
clays.     •  Potassium  clays  ;    o  hydrogen  clays.  •  Sodium  clays  ;    O  magnesium  clays. 

(3)  (T  —  L)  depends  on  the  base  holding  power  of  the  clay,  since  (T  —  W)  for  the 
hydrogen  clay  is  nearly  proportional  to  its  base  holding  power  at  pn  7,  fig.  10,  and  since 
(T  —  W)  for  those  calcium  clays  which  are  nearly  saturated  up  to  pn  7  with  calcium  is 
nearly  proportional  to  the  calcium  held  by  the  clay,  fig.  11.  Thus  (T  —  L)  is  approxi- 
mately proportional  to  the  exchangeable  ions  held  up  to  ^^n  7.  The  ratio  of  these  two 
quantities,  which  gives  the  mean  reduction  of  specific  volume  of  the  clay  in  water 
from  that  in  tetralin  per  equivalent  of  exchangeable  ion,  will  be  called  the  mean 
interaction  value  of  that  ion. 

To  investigate  what  proportion  of  the  total  hydrogen  ions,  that  can  be  exchanged 
from  a  clay,  take  part  in  this  interaction,  the  relative  constancies  of  the  different 
ratios  (T  —  W)/B  (x)  for  all  the  hydrogen  clays  available  have  been  compared,  where 
B  (a;)  is  the  equivalents  of  calcium  one  gram  of  hydrogen  clay  can  take  up  from  a 
calcium  buffer  solution  of  pn  (x)  and  N/50  m  calcium.  The  follow^ng  variates  were 
used  for  B  (,r)  : — * 

*  For  a  description  of  the  experimental  methods  used  to  measure  the  various  B  (x),  see  Schofield,  R.  K., 
'  J.  Agric.  Sci.,'  vol.  23,  252  (1933)  and  '  Trans.  Vlth  Comm.  Int.  Soc.  Soil  Sci.,'  B,  p.  80  (1933). 
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B  (HCl  —  Ac),  B  (HCl  —  N)  and  B  (HCl  —  P).  These  variates  denote  the  additional 
equivalents  of  calcium  held  per  gram  of  clay  in  a  solution  of  half-neutralized  calcium 
acetate  (jJh  4-5),  half-neutralized  calcium  j9-nitrophenol  {j)a  7),  or  half-neutralized 
calcium  phenol  (jj„  9-8),  respectively,  in  excess  of  the  bases  held  by  the  clay  in  a 
solution  of  0-04  N  HCl  and  0-01  N  metallic  chloride  ; 
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Fig.  8 — Aniline  Fig.  9 — Nitrobenzene 

Figs.  7,  8,  9. — Specific  volume  reduction  of  hydrogen  clays  in  different  liquids  compared  to  calcium  clays. 

B  (Chi  Ac  —  N),  denotes  the  corresponding  additional  quantity  of  calcium  held  in 
the  nitrophenol  solution  in  excess  of  the  calcium  held  in  a  half-neutralized  solution 
of  calcium  chloracetate  (p^  2  •  8)  ; 

B  (N),  B  (P),  denote  the  equivalents  of  calcium  taken  up  per  gram  of  clay  from  the 
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^-nitrophenol  and  the  phenol  buffers  respectively.  Fur  every  hydrogen  clay  used 
these  two  fall  between  the  hydrochloric  acid  and  the  chloracetate  values. 

In  Table  II  are  given  the  individual  values  of  (T  —  W)/B  (x)  for  all  the  hydrogen 
clays  available,  and  the  relative  constancy  of  these  ratios  can  be  compared  by  computing 
the  percentage  deviation  100  S/M  of  the  individual  clays  from  their  mean  M  for  each 
ratio,  where  S  is  the  standard  deviation  of  the  individual  (T  —  'W)/B  (x)  values  for  the 
clays  from  their  mean  M.  The  smallest  percentage  deviation  is  shown  by  using 
B  (CI  Ac  -  N)  for  B  {x),  while  B  (HCl  -  N)  and  B  (N)  are  almost  as  good.  The 
three  quantities  using  the  phenol  buffer  point,  of  which  only  two  are  given  in  the 
table,  give  definitely  larger  deviations,  while  the  acetate  buffer  point  is  obviously 
useless. 

The  variations  of  (T  —  W)/B  (x)  from  its  mean  are  probably  due  to  some  property  of 
the  clay  and  not  solely  to  chance  differences  in  the  amounts  of  residual  bases  left  in 


o 
o 


2000 


1000  2000  JOOO 

(T  -  W)„  (T  -  W)ca 

FiGf .  10. — Specific  volume  reduction  of  hydrogen  clays     Fig  .  11 . — Specific  volume  reduction  of  calcium  clays 
in  water  to  their  base  holding  capacity  at  pn  7.  in  water  to  their  calcium  content. 


the  clay  and  the  experimental  errors  in  the  determination  of  (T  —  W)  and  the  B's, 

for  the  values  of ,  where  y  is  either  HCl,  the  chloracetate  buffer,  or  nothing, 

B(y  —  N) 

1  1  ^  J     -^0,  ^1,  ^•^-     B  (HCl-Ph)    B  (HCl-N)       ,  B  (ClAc-N) 

are  strongly  correlated  with  the  quantities  ^  IhcI-Ac)'  B  (lICl-Ac)  '^"^^  B  (ClAc-Ac)- 

As  an  example,  the  scatter  diaoram  of with  ^  \^^', t-4  >   which   was 

^  ''  B(HCl-N)  B  (HCl -Ac)' 

chosen  because  it  gives  the  best  correlation,  is  given  as  fig.  12.     In  this  figure  five 

points  have  been  ringed,  namely,  two  points  for  the  Lower  Lias  clays,  and  one  for  each 

of  the  Carse,  Kirton-in-Lindsey  (dried)  and  Rothamsted  clays.     For  each  of  these 

clays  there  is  a  duplicate  determination  lying  nearer  the  regression  line.     If  these 
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five  "  bad  "  duplicate  determinations  are  ignored  there  is  only  one  point,  that  for  the 
Old  Red  Sandstone  clay,  that  lies  appreciably  off  the  lines.  None  of  the  other  possible 
(T  —  W)/B  (x)  values  gave  such  good  correlations  with  any  of  the  B  ratios.  Thus 
(T  —  W)/B  (y  —  N),  the  mean  interaction  value  of  each  exchangeable  hydrogen  ion  with 

water,  seems  to  increase  as  the  B  ratios,  for  example,       '      — ^  ,  increase. 

B  (HCl  —  Ac) 

B  (HCl  -  N) 


From  the  physical-chemical  standjjoint 


namely,    the    ratio    of   the 


B  (HCl  -  Ac) 

calcium  which  the  hydrogen  clay  holds  in  a  j9h  7  buffer,  to  the  calcium  which  it  holds 
in  a  j^H  4-5  buffer,  would  be  expected  to  measure  the  strength  of  the  clay  acid,  this 
ratio  increasing  as  the  strength  of  the  acid  decreases.  From  this  pomt  of  view,  the 
mean  effectiveness  of  the  hydrogen  ions  on  the  clay  in  reducing  the  specific  volume  of 
the  clay  in  water  would  increase  as  the  strength  of  the  clay  acid  decreases,  a  conclusion 
not  easy  to  interpret  in  physico-chemical  terms.  A  second  possibility  is  that  as  the 
clay  acid  becomes  weaker  the  interaction  between  the  molecules  of  the  dispersion 


B(HCl-Ac) 
Fig.  12. 


liquid   and   the   negative   charges   on  the   clay   surface   becomes   stronger.     A  third 

possibility  is  that  as  the  clay  acid  becomes  weaker  the  pores  in  the  clay  crumbs  become 

smaller,  so  that  capillary  condensation  becomes  increasingly  more  important.     This 

would  allow  the  actual  reduction  in  the  specific  volume  of  the  clay  in  water  brought 

about  by  the  hydrogen  ions  either  to  be  independent  of,  or  to  decrease  with,  the  strength 

of  the  clay  acid,  so  long  as  the  capillary  condensation  effect  was  more  or  less  proportional 

T)  /trr<i  ATX 

to  -r^  ,Vt/-.i r-^.  for  the  clay.     Unfortunatelv,  so  far  no  measurements  of  the  distri- 

B  (HCl  —  Ac)  -^ 

bution  of  pore  size  in  clay  crumbs  have  been  made. 
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The  effect  of  other  exchangeable  ions  held  by  the  clay  on  the  reduction  of  the  specific 
volume  of  the  clay  in  water  cannot  be  studied  so  simply  as  the  hydrogen  ion  effect, 
as  all  other  ionic  clays  contain  both  exchangeable  bases  and  exchangeable  hydrogen, 
so  that  the  effects  of  the  residual  hydrogen  on  the  specific  volume  is  always  sujier- 
imjjosed  on  the  effects  of  the  other  exchangeable  cations.  This  complication  will 
be  minimized  if  all  the  ionic  clays  used  are  in  equilibrium  with  a  standard  buffer  solution 
of  as  high  a  p^  as  possible.  There  are,  however,  experimental  difficulties  in 
preparing  pure  dry  clays  in  equilibrium  with  buffers  above  p^  7  owing  to  the 
formation  of  insoluble  calcium  and  magnesium  carbonates,  so  that  the  half-neutralized 
jo-nitrophenol  buffer  of  pn  7  was  used  as  standard  for  preparing  the  clays.  Further, 
from  the  preceding  discussion  on  the  hydrogen  clays,  it  appeared  that  this  was  about 
as  high  as  it  is  necessary  to  go  to  eliminate  the  effect  of  the  residual  hydrogen.  Although 
at  one  stage  of  their  preparation,  the  clays  were  in  equilibrium  with  this  buffer  solution, 
a  certain  loss  of  exchangeable  base  occurs  on  removal  of  this  solution,  so  that  the  dried 
clay  crumbs  do  not  actually  contain  sufficient  base  to  be  in  equilibrium  with  it,  but 
this  loss  does  not  usually  exceed  3  milli-quivalents  per  100  grams  of  clay.  For  these 
reasons,  the  iufluence  of  the  exchangeable  bases  on  the  specific  volume  of  the  clay  cannot 
be  accurately  determined.  If,  as  before  (T  —  W)  is  the  reduction  in  specific  volume 
of  the  clay  in  water  from  that  in  tetralin,  one  can,  as  a  first  approximation,  assume 
either  that  it  is  due  to  the  exchangeable  ions  actually  present  and  ignore  the  residual 
hydrogen,  or  that  it  is  due  both  to  the  exchangeable  ions  present  and  to  the  residual 
hydrogen,  with  the  implicit  assumption  that  the  latter  behaves  like  the  base.  On  the 
first  assumption  the  mean  effect  of  one  equivalent  of  base  held  by  the  clay  is  (T— 'W)/B, 
where  B  is  the  amount  of  base  actually  held,  and  on  the  second  is  (T  —  'W)/B  (HCl  —  N). 
Actually  the  difference  between  these  two  quantities  is  small,  and  the  results  are  quali- 
tatively the  same  on  either  assumption,  so  that  only  the  results  based  on  the  second 
assumption  will  be  given. 

Table  III  gives  the  values  of  (T  —  W)/B  (HCl  —  N)  for  all  the  available  non-hydrogen 
clays  which  contained  nearly  sufficient  base  to  be  in  equilibrium  with  a  j^h  7  buffer, 
and  Tables  II  and  III  also  give  the  values  of  (T  —  L)/B  (HCl  —  N)  for  the  hydrogen  and 
calcium  clays  when  L  is  the  specific  volume  of  the  clay  in  amyl  alcohol,  aniline  or 
nitrobenzene.  As  with  the  hydrogen  clay,  (T  —  L)/B  (HCl  —  N)  is  fairly  constant.  The 
individual  variations  from  the  mean  can  be  seen,  on  plotting  out,  to  be  dependent  on 

the  clay,  and  thus  on  ^  —l  .  Naturally,  in  considering  a  plot  of  two  quan- 
tities such  as  (T  —  L)/B  (HCl  —  N)  against  one  another,  experimental  errors  are  very 
serious,  for  not  only  are  the  (T  —  L)'s  and  the  individual  B's  subject  to  appreciable 
errors,  but,  since  the  degree  of  saturation  of  the  clays  with  base  is  variable,  this  also 
produces  a  source  of  error  in  the  clay. 

A  certain  number  of  experiments  were  made  with  clays  holding  less  base  than  was 
necessary  to  bring  them  into  equilibrium  with  a  pt^  7  buffer.     The  clays  were  intended 
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to  be  in  equilibrium  with  half -neutralized  acetate  buffer  of  p^  4-5  containing  N/50  base 
and  N/25  acid,  but  they  were  found  to  have  taken  u^j  too  much  base,  the  extra  propor- 
tion taken  up  varying  from  clay  to  clay.  The  results  of  the  comparisons  are  that  for 
the  London,  Clloucester,  and  Lower  Lias  clays  (T  —  L)  seems  to  be  the  same  for  the 
clay  only  partially  and  almost  completely  saturated  up  to  pn  7,  while  for  the  Kirton-in- 
Lindsey  clay  (T  —  L)  increases  appreciably  between  these  two  levels.  The  specific 
volume  of  the  partially  saturated  clay  in  tetralin  always  lies  between  that  of  the 
hydrogen  and  the  fully  saturated  clay.  This  indicates  that  not  all  the  exchangeable 
ions  are  equally  effective  in  causing  this  reduction  of  specific  volume,  but  that  those 
which  can  be  held  in  a  fairly  acid  buffer  have  usually  a  greater  effect. 


Table 

III     (T 

-L)/B 

for  the  Non-Hydrogen  Clays 

(T 

-  W)/B  (HCl  -  N) 

(T  -  L)  B/(HC1  - 

-  N)  for 

B(HCI  -  N) 

Calcium 

X    lOr; 

A 

n 

/N    1V5 

Calcium 

Mag- 
nesium 

Sodium 

Potas- 
sium 

Amyl 
Alcohol 

Aniline 

Nitro- 
benzene 

Trias  (o) 

..       23-8 

31-3 

33-6 

19-0 

15-3 

19-5 

19-5 

17-7 

{h) 

..       23-8 

31-4 

— 

— 

— 

— 

. — 

— 

Rothamsted 

..       39-7 

31-4 

— 

22-6 

16-0 

— 

— 

— 

London  I  (a)     ... 

..      47-0 

27-2 

32-0 

16-8 

14-8 

23-2 

23-0 

16-0 

(h)     ... 

..      47-0 

30-1 

— 

— 

— 

— 

— 

20-6 

London  III 

..      49-2 

30-1 

33-8 

20-9 

17-6 

33-7 

23-0 

— 

Lower  Lias  Ser.  I. 

..       47-5 

35-6 

36-8 

29-0 

24-2 

— 

— 

— 

Ser.  II 

..       51-9 

30-0 

— 

— 

— 

— 

— 

13-7 

Carse      

..      48-1 

43-0 

— 

37-4 

26-7 

— 

— 

— 

Gloucester  Ser.  I. 

..       64-4 

— . 

— 

— 

— . 

— 

— 

15-6 

Ser.  II. 

fine 

..       68-4 

38-4 

— 

35-1 

— 

—   " 

— 

15-2 

Ser.  II. 

coarse 

..       69-3 

37-0 

— 

26-0 

— 

— 

— 

— 

Kirtou-in-Lind.sey 

..       70-6 

39-1 

— 

30-4 

— 

— 

23-9 

— 

Palestine 

..       91-2 

30-9 

31-8 

— 

— ■ 

26-3 

— 

19-1 

Fen        

..     124 

40-1 

— 

— 

— 

— 

— 

— 

Mean 


34-0 


33-6 


26-4 


19-1 


25-7 


22-3 


16-9 


The  mean  values  of  the  quantities  (T  —  L)/B(HC1  —  N)  for  all  the  hydrogen  clays  and 
all  the  basic  clays  nearly  in  equilibrium  with  the  pa  7  buffer  are  given  in  Table  IV. 
The  values  are  not  all  directly  comparable,  however,  since  the  means  are  derived  by 
averaging  over  different  clays.  For  example,  if  the  values  for  the  calcium  and  mag- 
nesiimi  are  compared  for  the  same  clays  in  water  the  values  are  Ca  =  31  •  0  and  Mg=:33  •  6, 
showing  that  magnesium  gives  a  definitely  larger  specific  volume  reduction  in  water 
than  calcium.  The  general  order  of  ion  effectiveness  is  independent  of  the  wetting 
liquid,  and  the  order  of  increasing  specific  volume  reduction  brought  about  by  one 
equivalent  of  exchangeable  ion  is 


K,  Na,  Ba,  H,  Ca,  Mg. 
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Table  IV — Mean  values  of  (T  —  L)/B  (HCl  —  N)  for  different  exchangeable  ions  and 

different  liquids 


Potassium 

Sodium 

Hydrogen 

Calcium 

Magnesium 

Water 

Amyl  Alcohol 
Aniline 
Nitrobenzene 

19-1  (6) 

11-3(1) 

9-5(1) 

7-3(1) 

26-4  (9) 
11-4(1) 
10-4(1) 

8-5  (2) 

31-4(20) 
23-7  (6) 
26-1  (4) 
13-9(12) 

34-0(14) 

25-7  (4) 
22-3(4) 
16-9  (7) 

33-6  (5) 
36-1  (2) 
33-4(1) 

20-0  (2) 

The  fi 

gures 

in  brackets  give 

the  number  of  clays  used  to  determine  the  mean 

value. 

Relative  to  Ca  =  ] 

[  for  each  liquid* 

Potassium 

Sodium 

Hydrogen 

Calcium 

Magnesium 

Water 

Amyl  Alcohol 
Aniline 
Nitrobenzene 

0-59 
0-33 
0-41 

0-46 

0-76 
0-35 
0-45 
0-49 

0-91 

0-86 
1-17 

0-84 

1 

1 
1 
1 

1-08 
1-21 
1-46 
1-14 

Surface    density 
charge 

of 

0-38 

0-55 



1 

1-48 

*  In  this  table  only  results  based  on  direct  comparisons  with  the  same  clays  have  been  used,  so  this 
table  cannot  be  derived  from  the  upper  half. 

With  the  exception  of  hydrogen  there  is  not  a  single  experiment  in  all  those  performed 
in  which  this  order  comes  out  significantly  different.  The  position  of  hydrogen  clay  in 
water  is  erratic,  for  although  on  the  average  its  effect  is  less  than  the  calcium,  in  two 
experiments  it  comes  above  calcium  and  in  two  below  sodium.  The  position  of  barium 
is  based  on  a  comparison  of  only  one  clay  saturated  with  different  exchangeable  ions. 
This  order  for  the  activeness  of  the  exchangeable  ions  in  interacting  with  water  has  also 
been  found  by  Pate*,  Baver,!  and  AndersonJ  who  measured  the  heat  of  wetting  of 
soils  and  clays  saturated  with  different  exchangeable  ions,  and  by  Janert§  who 
measured  the  heat  of  wettmg  of  two  of  the  separated  clays  here  described  (Trias, 
Lower  Lias),  so  that  magnesium  clays  do  seem  to  interact  with  water  or  other  polar 
dispersion  medium  more  actively,  and  potassium  clays  less  actively,  than  any  of  the 
other  ionic  clays  used. 

If  the  mean  values  of  (T  —  L)/B  (HCl  —  N)  for  the  different  exchangeable  ions  are 
calculated  relative  to  the  calcium  ion  for  the  four  liquids  given  in  Table  IV,  these  relative 
values  are  more  or  less  mdependent  of  the  liquid,  as  is  seen  in  the  lower  half  of  the  table  ; 
they  are  primarily  a  property  of  the  ion  and  not  of  the  ion-liquid  system.  These  relative 
values  could  probably  be  predicted  from  the  hypothesis  that  (T  —  L)/B  (HCl  —  N),  the 
effectiveness  of  the  ion  in  reducing  the  specific  volume  of  a  clay  in  a  polar  liquid,  is  propor- 
tional to  the  mean  surface  density  of  electric  charge  on  the  ion,  namely,  ze/4  Tir^,  where 

*  '  Soil  Sci.,'  vol.  20,  p.  329  (1925). 

t  '  J.  Amer.  Soc.  Agron.,'  vol.  20,  p.  921  (1928). 

i  '  J.  Agric.  Res.,'  vol.  38,  p.  565  (1929). 

§  '  J.  Agric.  Sci.,'  vol.  24,  p.  136  (1934). 
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r  and  ze  are  the  radius  and  the  electric  charge  on  the  ion.  This  hypothesis  cannot  be 
tested  directly  since  the  radii  of  the  ions  are  not  known,  but  if  the  radii  of  the  metal  atoms 
in  the  metal  crystal,  which  are  known  from  X-ray  data,  are  used,  a  test  can  be  made. 
In  the  last  row  of  Table  IV  are  given  the  surface  densities  of  charge  on  the  different  ions 
relative  to  calcium  calculated  in  this  way,  the  radii  of  the  atoms  used  in  this  calculation 
being  Mg  1  -61,  Ca  1-97,  Na  1-86,  and  K  2-25A.*  These  figures  are  seen  to  be  of  the 
same  order  of  magnitude  as  the  efiectiveness  of  the  three  metallic  ions  relative  to  calcium  ; 
thus  giving  some  support  to  the  hypothesis.  Exact  agreement  could  not  be  expected, 
since  the  liquids  will  interact  to  some  extent  with  the  negative  charges  on  the  clay 
surface,  and  in  the  above  calculation  this  interaction  has  all  been  attributed  to  the 
exchangeable  ions.  No  method  has  yet  been  used  to  measure  this  interaction,  which 
probably  varies  with  the  exchangeable  ion,  for  one  would  expect  that  the  larger  the 
proportion  of  the  exchangeable  ions  dissociated  from  the  clay  surface,  the  larger  will  be 
the  number  of  negative  charges  on  the  clay  surface  that  can  interact  with  the  molecules 
of  the  dispersion  medium.  Evidence  of  this  effect  can  be  detected  in  the  latter  half  of 
Table  IV ;  for  example,  the  sodium  and  jjotassium  figures  are  too  high  as  compared 
with  calcium  in  water,  owing  to  the  larger  dissociation  of  these  ions  from  the  clay.  There 
is  not  yet  sufficient  data  on  the  relative  dissociations  of  the  different  ions  from  clays 
dispersed  in  different  liquids  to  make  possible  a  quantitative  discussion  of  these 
discrepancies. 

The  conclusion  of  this  part  of  the  experimental  results  is,  therefore,  that  (T  —  L)  is 
controlled  by  three  main  factors  :  the  liquid  L  used,  the  amount  and  the  surface  charge 
density  of  the  exchangeable  ions  on  the  clay,  and  by  one  subsidiary  factor,  specifying 
to  some  extent  the  shape  of  the  buffer  titration  curve  of  the  clay  or  the  strength  of  the 
clay  acid.  This  factor  is  probably  connected  with  the  interaction  between  the  negative 
charges  on  the  clay  surface  and  the  molecules  of  the  wetting  liquid. 

The  Projyerties  of  the  Wetting  Liquid  Affecting  the  Specific  Volume  of  the  Clay 

A  few  experiments  were  made  with  other  polar  liquids  to  try  and  discover  what 
properties  of  the  wetting  liquid  are  connected  with  this  reduction  of  the  clay  specific 
volume.  For  the  liquids  considered  :  water,  amyl  alcohol,  anilme,  and  nitro-benzene, 
the  order  of  effectiveness  of  the  liquids  in  decreasing  the  specific  volume  of  the  clay 
is  nitrobenzene  least,  then  aniline,  then  amyl  alcohol,  with  water  greatest ;  and  this 
order  is  almost  independent  of  the  exchangeable  ion  on  the  clay,  as  is  seen  from  Table  IV. 
Hydrogen  clay  in  aniline  is  the  only  definite  exception,  though  possibly  magnesium 
clay  in  amyl  alcohol  may  be  one  also.  Magnesium  clay  in  aniline  is  not  one.  since  in 
the  only  comparison  available  (T  —  W)  comes  out  larger  than  (T  —  An).  The  hydrogen 
exception  is  definite,  as  is  seen  from  all  the  four  available  comparisons  of  the  hydrogen 
clays  with  calcium  clays  given  in  Tables  II  and  III,  and  is  due  to  a  specific  interaction 

*  Taylob,  •■  A  Treatise  on  Physical  Ctemistry,"  vol.  I,  p.  163  (1930). 

VOL.    CCXXXIII. — A.  3    D 


386  E.  W.  RUSSELL   ON  THE   INTERACTION   OF   CLAY 

between  the  hydrogen  clay  and  the  amine  group  of  aniline,  for  the  same  result  is  obtained 
if  o-xylidine  or  dimethyl  aniline  is  used  instead.  Thus,  for  the  separated  London  III 
clay,  the  reduction  of  specific  volume  from  the  tetralin  figure  of  the  hydrogen  and 
calcimn  clays  iu  these  three  amines  is 

Aniline  o-Xylidine  Dimethyl  aniline 

Hydrogen  clay       0-01410  0-01478  0-01295 

Calcium  clay  0-0n35  0-01008  0-00857 

Difference    0-00275  0-00470  0-00438 

Presumably  a  simple  chemical  action  is  involved  :  the  amine  group  opens  up  and  is 
adsorbed  to  give  a  substituted  ammonium  clay,  thus 


[C%|  -  H  +  NH^CeHj  ->  [Clag  -  NHj-CeHj. 


The  only  factor  affecting  the  liquid  order  so  far  recognized  is  chemical  action  of  the 
type  just  discussed.  The  liquid  order  is  neither  in  the  order  of  their  dielectric  constants 
nor  of  the  quantities  (x^/e  of  Ostwald,*  where  y.  is  the  dipole  and  e  the  dielectric  constant 
of  the  liquid,  though  with  the  exception  of  nitrobenzene  it  is  in  the  order  of  their  dipole 
moments  which  is  aniline  (1-55  x  10"^*  e.s.u.),  amyl  alcohol  (1-68),  water  (?  1-8), 
nitrobenzene  (3 -SO).!  It  is  not  possible  at  present  to  account  for  this  discrepancy  in 
the  order  of  nitrobenzene. 

Differences  in  dipole  moment  between  two  similar  liquids  whose  molecules  are  about 
the  same  size  can  nearly  quantitatively  account  for  the  different  reductions  of  the 
specific  volume  of  a  clay  in  them  below  the  clay's  specific  volume  in  tetraUn,  for  under 
these  conditions  the  reduction  iia  specific  volume  is  approximately  proportional  to  the 
dipole  moment  of  the  liquid.  As  an  example,  the  specific  volumes  of  two  calcium, 
two  magnesium,  a  hydrogen,  and  a  sodium  London  III  clay  were  determined  in  tetralin, 
amlme,  and  dimethyl  aniline,  and  the  ratios  of  the  reduction  in  the  specific  volume  in 
aniline  below  that  in  tetralin  to  the  reduction  in  dimethyl  aniline  were  : 

Hydrogen  clay  Magnesium  clays  Calcium  clays  Sodium  clay 

1-09  1-12  1-33  1-24 

1-08  1-35 

while  the  ratio  of  the  dipole  moments  of  aniline  and  dimethyl  aniline  is 


1-4    X  10-" 

=  1-11,  which  is  in  good  agreement  with  the  hydrogen  and  magnesium  figures,  but  not 
so  good  for  the  calcium  or  sodium,  though  the  sodium  figm'e  is  subject  to  a  large  experi- 
mental error.  A  further  comparison  was  made  of  the  specific  volumes  of  the  three 
calcium  clays.  Lias,  London  I,  and  Gloucester  in  the  liquids  nitrobenzene,  chlorbenzene, 

*  OsTWALi)  and  Ortloff,  '  Kolloidzschr.,'  vol.  59,  p.  25  (1932). 

t  The  dipole  moments  have,  wherever  available,  been  taken  from  Sutton,  '  Proc.  Roy.  Soc.,'  A,  vol.  133, 
p.  685  (1931). 
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benzyl  chloride,  and  benzyl  alcohol,  and  the  results  are  given  in  Table  V,  where  N,  CIB, 
BCl,  and  B  Ale  denote  the  specific  volumes  of  the  clays  in  these  four  liquids. 

Table  V — Comparison  of  the  Reduction  of  Specific  Volume  of  a  Clay  in  two  Liquids 

with  the  Ratio  of  their  Dipole  Moments 

T-N  T  -  BAlc  T  -  BAlc  T  -  BCl 


T-CIB 

T-BCl 

T-N 

T-CIB 

Ca-Lias      

Ca-London  I 
Ca-Gloucester        

3-13 

3-02 
1-09 

2-06 

1-20 
M4 
1-31 

1-26 

Ratio  of  the  dipole 

moments  of  the  liquids 

••     3-93  _  2. 
•■      1-56 

52 

1 
1 

4-«-«« 

1 

3 

4-»« 

The  determination  of  the  specific  volume  of  the  Lias  and  London  clays  in  benzyl  chloride 
failed  owing  to  a  violent  reaction  which  took  place  between  the  clays  and  the  benzyl 
chloride,  and  which  resulted  in  HCl  being  given  ofi  and  a  red  liquid  remaining.  It  is 
seen  that  using  the  liquids  nitrobenzene,  chlorbenzene  and  benzyl  chloride,  the  relative 
reductions  of  clay  specific  volumes  in  them  is  of  the  same  order  as  the  ratio  of  their 
dipole  moments,  but  the  liquid  benzyl  alcohol  lies  well  outside  this  grou25.  Thus  there 
appear  to  exist  sets  of  liquids  such  that  differences  between  the  reduction  of  the  specific 
volume  of  a  clay  in  any  two  liquids  in  a  given  set  can  be  accounted  for  nearly  quanti- 
tatively by  the  difference  of  dipole  moment  of  the  molecules  of  the  two  Liquids.  The 
liquid  property  that  determmes  to  what  set  a  given  liquid  shall  belong  is  not  the  mole- 
cular size,  for  benzyl  alcohol  and  benzyl  chloride,  for  example,  belong  to  different  sets, 
but  is  probably  the  comjjlex  property  of  the  liquid  that  determines  how  much  molecular 
association  takes  place,  and  how  much  capillary  condensation  the  liquid  will  suffer 
in  the  clay  crumbs. 

A  few  experiments  were  made  to  estimate  the  magnitude  of  the  capillary  condensation 
effects  with  different  liquids  by  comparing  the  specific  volume  of  the  clay  when  fully 
dispersed  in  the  liquid  with  that  of  the  dried  clay  crumb  wetted  with  the  liquid.  If 
(J,,  and  o,  are  the  specific  volumes  of  a  dispersed  clay  and  of  the  dry  clay  crumbs,  and 
if  nic  grams  of  liquid  suffer  capillary  condensation  per  gram  of  clay  crumb,  and  if  v^ 
is  the  mean  volume  of  the  condensed  liquid  and  v^  of  the  free  liquid,  then 

c!^  —  a,  =  ni,{v„~v,) (1) 

provided  that  (1)  the  crumbs  are  completely  wetted,  (2)  the  weight  of  imbibed  liquid  per 
gram  of  clay  is  the  same  for  both  systems,  and  (3)  in  the  determination  of  the  specific 
volume  the  clays  have  the  same  weight  of  residual  liquid  when  weighed  dry.  The 
effects  of  the  first  two  provisos  on  equation  (1)  are 

(1)  suppose  a  volume  Va  cc  of  air  is  entrapped  in  the  crumbs  per  gram  of  clay,  then 

a,i  —  ff,  =z  m,  {Vo  —  V.)  —  Va 

3  D  2 
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(2)  suppose  wi;  and  m';  grams  of  imbibed  liqiiid  are  held  per  gram  of  dispersed  and 
cnimbed  clay,  and  let  («„  —  S*')  be  the  mean  specific  volume  of  the  liquid  represented 
by  the  weight  {mi  —  m',),  then 

a,i  —  G,^  Mc  {v„  —  Vc)  —  (m;  —  m'i)Su 

showing  that  o^  or  ct,,  is  only  affected  by  the  imbibitional  liquid  if  there  is  an  appreciable 
change  of  volume  of  the  liquid  on  imbibition. 

The  chief  limitation  of  the  exjaerunents  performed  is  the  difficulty  in  determining 
the  weight  of  clay  in  the  dispersion.  The  weight  of  clay  crumbs  used  for  a  specific 
volume  determination  was  taken  as  the  weight  of  the  crimibs  after  they  had  been 
drying  in  a  desiccator  over  phosphorus  pentoxide  in  vacuo  for  about  three  weeks. 
The  weight  of  clay  used  in  the  water  dispersion  was  determined  by  removing  the  main 
bulk  of  the  water  by  evaporation  on  a  water  bath,  drymg  at  110°  C  for  24  hours,  and 
then  drying  over  phosphorus  pentoxide  in  vacuo  for  about  a  fortnight.  Under  these 
conditions  it  is  fairly  certain  that  the  moisture  content  of  the  two  samples  of  clay  wiU 
agree  within  a  few  tenths  of  a  milligram  per  gram  of  clay.  The  effect  of  this  difference 
can  easily  be  calculated,  for  if  the  dry  clay  from  the  dispersion  had  a  moisture  content 
m,  per  gram  of  clay  more  than  the  dry  clay  crumbs,  then 

<^d  —  <^c  =  m,_-  {v„  —  V,)  +  m,  {v,  —  a)  (approx.), 

where  a  is  the  specific  volume  of  the  clay.  Thus,  if  mr=  10^*  grams  and  ct  =  0-35, 
probably  a  fair  average  value,  then  a,,— a,  =  m,.{v^  — v,) -\- 3-5  X  10''\  Hence 
each  tenth  of  a  milligram  difference  of  moisture  content  per  gram  of  clay  between  the 
two  dry  clays  gives  a  difference  of  3-5  X  10^"  between  the  two  specific  volumes. 

The  results  of  the  comparisons  between  the  specific  volumes  of  deflocculated  clays 
in  water  and  of  their  crmnbs  in  water  are  given  in  Table  VI,  which  shows  that  o^  is 
usually  greater  than  a,,,  so  that  if  much  capillary  condensation  is  taking  place,  this 

Table  VI — Comparison  of  the  Specific  Volume  of  Deflocculated  and  Crumbled  Clay 

in  Water 


Specific  volume  of 

Specific  volume  of 

Difference 

deflocculated 

clay 

crumbed  clay 

(CTa  -  Oc) 

Od 

Ge 

X  10* 

Kirton-in-Lindsey  NaAc 

0-3647 

0-3656 

-  9 

NaN 

0-3605 

0-3602 

+  3 

Carse  Na           

0-3792 

0-3800 

-  8 

London  I  H 

0-3547 

0-3570 

-23 

Ca 

0-3536 

0-3547 

-11 

effect  is  more  than  counterbalanced  by  incomplete  wetting  of  the  crumbs,  or  to  a  smaller 
imbibition  of  water  by  the  crumbled  clay  than  by  the  dispersed  clay,  or  to  incomplete 
drying  of  the  clay  before  wetting.  Since  none  of  the  last  three  factors  can  be  very 
large,  only  a  little  capillary  condensation  can  be  taking  place. 
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Turning  to  the  results  of  this  comparison  for  non-aqueous  dispersions,  the  technique 
was  very  unsatisfactory,  since,  although  the  results  showed  that  appreciable  capillary 
condensation  appeared  to  take  place  if  the  dried  clay  crumbs  were  wetted  with  ethyl  or 
amyl  alcohol,  two  sources  of  uncertainty  enter  rendering  the  results  doubtful.  In  the 
first  case  the  density  of  the  free  liquid  in  the  clay  dispersion  always  had  a  higher  density 
than  the  pure  liquid,  and  in  the  second  the  clay  crumbs  were  weighed  dry  before  being 
wetted  with  the  organic  liquid,  while  the  dispersed  clay  was  weighed  after  it  had  been 
dried  from  the  liquid,  so  that  errors  of  one  or  two  milligrams  per  gram  of  clay  could 
easily  arise  between  the  comparative  weights  of  the  same  clay  sample,  due  to  different 
residual  liquids  on  the  dry  clay.  Thus,  although  capillary  condensation  probably 
takes  place  in  clay  crumbs  in  organic  liquids,  the  results  give  no  certam  proof  that  this 
is  so. 

The  author  is  indebted  to  Dr.  B.  A.  Keen  and  also  to  Dr.  R.  K.  Schofield  for  many 
helpful  suggestions  durmg  the  course  of  this  work. 

Summary 

Clay  particles  can  form  strong  aggregates,  or  crumbs,  when  dry,  only  if  the  clay 
particles  are  sufficiently  small,  if  there  are  a  sufficient  number  of  small  exchangeable 
ions  on  the  clay,  and  if  the  clay  has  been  dried  from  a  dispersion  medium  whose  molecules 
are  polar  and  sufficiently  small. 

Individual  clay  particles  have  an  appreciable  interaction  with  the  molecules  of  polar 
liquids  ;  they  do  not  iateract,  or  only  slightly  with  non-polar  liquids.  On  the  hyj^othesis 
that  no  differential  interaction  occurs  between  the  exchangeable  ions  on  the  clay  and 
a  non-polar  hydrocarbon,  the  magnesium  ions  on  the  clay  have  about  seven  and  the 
calcium  ions  about  three  more  molecules  of  water  of  hydration  than  the  sodium  or 
potassium  ions.     Independent  evidence  is  given  for  this  conclusion. 

Clays  interact  with  water  and  organic  liquids  containing  a  polar  group  by  causing 
a  contraction  in  volume  of  the  liquid.  For  homionic  clays,  saturated  up  to  p^  7  with 
base,  this  contraction  is  nearly  proportional  to  the  number  and  mean  charge  density 
on  the  surface  of  the  exchangeable  ions  on  the  clay  and,  to  a  less  extent,  it  varies 
with  a  parameter  specifying  the  shape  of  the  titration  curve  of  the  clay. 

These  results  have  been  interpreted  on  the  hyjiothesis  that  cations  can  orientate  polar 
molecules  of  the  dispersion  liquid  aromid  them,  and  this  power  is  proportional  to  their 
surface  density  of  charge.  This  power  is  also  possessed  by  the  free  negative  charges 
on  the  clay  particle.  When  the  dispersion  liquid  has  nearly  all  been  removed  the  cations 
bind  the  negative  charges  on  two  clay  particles  together  by  means  of  bridges  of  strongly 
orientated  molecules  of  the  polar  dispersion  liquid. 


Printed  in  London,  Great  Britain,  by  Harbison  and  Sons,  Ltd.,  St.  Martin's  Lane,  W.C.2. 
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